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We present evidence indicating that Anosov systems can be endowed with a unique physically 
reasonable effective temperature. Results for the two paradigmatic Anosov systems (i.e., the cat 
map and the geodesic flow on a surface of constant negative curvature) are used to justify a proposal 
for extending Ruelle's thermodynamical formalism into a comprehensive theory of statistical physics 
for nonequilibrium steady states satisfying the Gallavotti-Cohen chaotic hypothesis. 
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I. INTRODUCTION 

The chaotic hypothesis of Gallavotti and Cohen [I] is that 

for the purpose of studying macroscopic properties, the time evolution map [T] of a many-particle system 
can be regarded as a mixing Anosov map [2]. 

The principal consequence of this hypothesis to date is the fluctuation theorem for deterministic reversible dynamics 
[IJ[3J. Like its analogue for stochastic dynamics [H[5], the fluctuation theorem provides a quantitative relation between 
entropy production values of equal magnitude and opposite signs through consideration of the time-reversed dynamics. 
These fluctuation relations apply even to singular systems such as particle systems obeying Lennard- Jones potentials 
[5] and fit into the mathematical framework of large-deviations theory [7] . They have been shown to generalize both 
the Onsager reciprocity relations and the Green-Kubo formulae linking fluxes and transport coefficients; as such they 
play a fundamental role in statistical physics [2j. 

In this paper we explore the chaotic hypothesis from a different perspective, presenting evidence indicating that 
Anosov systems can be endowed with an essentially unique effective temperature and a corresponding energy function 
that can be calculated directly in terms of the ergodic statistics of the time evolution. In the context of Anosov 
systems this prescription is entirely dynamical and draws on the SRB measure of the system, highlighting its physical 
relevance [5J [5]. On the other hand, the present considerations inform the choice of an appropriate timescale for 
computing the effective temperature and the scaling behavior of the effective temperature with respect to the number 
of states. 

Results for the two paradigmatic Anosov systems (i.e., the cat map and the geodesic flow on a surface of constant 
negative curvature) are used to justify a proposed framework extending Ruelle's thermodynamical formalism into a 
comprehensive theory of statistical physics for nonequilibrium steady states satisfying the chaotic hypothesis. The 
key results are a series of analytical and numerical calculations identifying nontrivial limiting behavior of the effective 
temperature under successive physically motivated refinements of phase space and culminating in sections [Xj |XIV[ 



and |XVl These combine to suggest that a generalized variational principle (likely to be recognizable as minimizing the 
effective free energy in some way; see also appendix [B]) singles out a preferred effective temperature and concomitant 
effective energy function. 

The paper is organized as follows. Background on Anosov systems and SRB measures is given in section|TTj Section 
III] serves to couch the construction of Markov partitions in a physical context and describes how to obtain a partition- 
independent energy function before discussing the role of the mixing time as the preferred characteristic time for an 
Anosov system. 

In section [TV| we briefly describe the unique effective inverse temperature of a finite system with stationary probabil- 
ities pj and characteristic time t^ that is both consistent with equilibrium statistical physics and physical constraints 
on scaling behavior. Up to a fixed choice of scale, it is given by /? = t^ \\p\\ \/||7|| 2 +l, where 7^ = i Y^=i ^°&Pj~ l°gPfc- 
By using this and closing the Gibbs relation p^ = Z~ 1 e~ l3Ek , it is possible to use the idiom of equilibrium statistical 
physics to describe the behavior of nonequilibrium steady states. Most of the paper will deal with understanding this 
approach in the context of a physically reasonable application to Anosov systems. 

Section [V] introduces the Arnol'd-Avez cat map and the classical Markov partition of Adler and Weiss before 
illustrating the first indication of nontrivial limiting behavior for the effective temperature. In section |VI| a general 
explanation for this behavior is given; section |VII| illustrates why it is nontrivial by way of an example. The implications 
of this limiting behavior for constraining the detailed form of the effective temperature in a somewhat unexpected 
way are discussed in section [VIII[ 

The issue of uniformity of Markov partitions is discussed in section |IX| and serves to provide background for a 
physically motivated class of refinements. Section [X] continues in this vein by introducing a method of obtaining 
the most uniform possible refinements of Markov partitions w/r/t the ambient Riemannian measure and identifies an 
apparently unique effective temperature for the cat map on this basis. The role of ensembles and the thermodynamical 
limit is discussed in section |XI| to provide a broader context for the ideas presented in this paper before section |XII| 
outlines the essentials of a proposal for extending Ruelle's thermodynamical formalism to a complete theory of 
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statistical physics for nonequilibrium steady states in the context of the chaotic hypothesis. The so-called Ulam 
method for approximating SRB measures via piecewise approximations on generic (non-Markov) partitions is briefly 
touched on in section [XlTTJ along with its relevance for approximating the mixing time. 

In section |XIV| a continuous-time version of the cat map is introduced and related to a Hamiltonian system before 
introducing the cat flow as the simplest Anosov flow and relating its effective statistical physics to that of the cat 
map, providing an indication that Anosov flows also appear to enjoy a uniquely determined effective temperature. 
The subject of Anosov flows is continued with the introduction of the paradigmatic example of geodesic flow on a 
surface of constant negative curvature in section |XV| and culminates in the numerical demonstration of a well-behaved 



limit for the effective temperature in section XVI 



Sections |XVII| and |XVIII| discuss the effective temperature focused on in this paper in relation to the dynamical 
temperature introduced by Rugh |10) and the fluctuation-dissipation (effective) temperature respectively. After 
the conclusion appendices arc provided on a technical lemma, the so-called variational principle, and a single Glauber- 
Ising spin as a basis for comparison of the fluctuation-dissipation temperature with the effective temperature at the 
center of our considerations. 



II. BACKGROUND ON DYNAMICAL SYSTEMS 



In this section we provide definitions and notation for the necessary mathematical background, following (e.g.) [7J 
rT^HTj] . For an overview of these concepts from the point of view of statistical physics see §9 of [2] • (A physical concept 
more generally familiar than the chaotic hypothesis but still incorporating much of the mathematical infrastructure 
introduced immediately below is furnished by Lyapunov exponents. |134j ) 

A diffeomorphism between Riemannian manifolds M and A is a map T : M N such that both T and T _1 
are smooth. Typically we will write Tx in place of T(x). A closed subset Mo of M is hyperbolic if TMq = Mq 
and the bundle of tangent spaces {T X M} X £M admits a decomposition T X M = E x © E x into stable and unstable 
subspaces varying continuously w/r/t x and such that a) the decomposition is invariant: i.e., the derivative satisfies 
D x T(E { x ' u) ) = E^ u) and b) there exist C > and < A < 1 such that \\D x T^ n ^v (s ^\\ < CX n \\v ( - s ' u '>\\ for all 
v ( s , u ) £ E X S ' U ^ and n > 0. Note that hyperbolicity does not depend on a particular choice of Riemannian metric, but 
only on the existence of a suitable one. 

A point x <E M is nonwandering iff for all U C M with x € U, there exists n > such that T n UC\U ^ 0. Denote the 
set of nonwandering points by O(T). T is Axiom A if fl(T) is hyperbolic and equals the closure of the set of periodic 
points. T is Anosov if M itself is hyperbolic. An Anosov flow is defined similarly, but with an invariant decomposition 
of the tangent bundle of the form T X M = E x © E x © E x , where the neutral direction of the flow corresponds to a 
one-dimensional family of subspaces E x . (See figure [l]) Thus the construction of Anosov diffeomorphisms and flows 
require at least two and three dimensions, respectively. 

E s 

X 




FIG. 1: Schematic local geometry of a three-dimensional Anosov flow. 

If fi is a T-invariant probability measure on M, then T is mixing if fi(X n T Y) —> n(X)fi(Y) for measurable 
sets X and Y . While mixing is a very strong statistical property of a measure-preserving dynamical system, in the 
special case of Anosov diffeomorphisms it appears to reduce to a technical assumption for practical purposes, and 
one that is satisfied in all known cases when M is connected. (There are Anosov flows that are not mixing, and 
we shall actually encounter the simplest such flow in this paper; however, the assumption of mixing still appears to 
be technical in nature.) Typically we will denote the normalized Riemannian volume on M by i/. The normalized 
Liouville measure on the unit tangent bundle is given by a product of the normalized Riemannian volumes on tangent 
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spheres UT X M := {v e T X M : \\v\\ = 1} and on M. A dynamical approach to the Liouville measure adapted to the 
present context is given in |15j . 

The chaotic hypothesis was motivated by Ruelle's elucidation of a thermodynamical formalism for Axiom A systems 
that provides a great deal of explanatory power for the study of nonequilibrium steady states [16] . In particular, the 
theory for mixing Anosov systems gives the existence of a unique T- invariant probability measure (J-srb, called the 
Sinai-Ruelle-Bowen or SRB measure [17] and satisfying lim^oo kr 1 X^=o F{T^x) = J M F d^sRB for ^-almost all x 
when F is continuous. [135J 

Because in general hsrb is singular w/r/t v, a naive density of the form p = d^sRB/dv typically fails to be well- 
defined. Consequently, a naive generalization of the formula for the inverse effective temperature in which sums are 
replaced by integrals w/r/t v also typically fails to be well-defined. Much of this paper is implicitly concerned with 
overcoming obstacles raised by this fact, which is specifically addressed in more detail in section [VIII| 

The existence of a unique SRB measure generalizing the microcanonical ensemble, along with the chaotic hypothesis 
and the associated fluctuation theorem, serves to indicate the relevance of the theory of Anosov systems to statistical 
physics. The expressiveness of the theory is due chiefly to the consequences of a local product structure on M 
inherited from the decomposition into stable and unstable tangent spaces. A brief sketch will serve to introduce the 
basic constructions of interest in this regard. 

Let T be an Anosov diffeomorphism. For x <E M and U C M, let W^ s ' u \x) respectively denote the global stable 
and unstable manifolds containing x, let We S ' u \x) denote local stable and unstable manifolds (defined respectively 
as the sets {y e M : d{T^'^x, T^'^y) < e, Vn > 0}), and let W^ s ^(x, U) denote the intersection of U and local 
stable and unstable manifolds that minimally stretch across U. A subset R of M is called a rectangle if its diameter 
is sufficiently small that for any x,y € R there exists e such that [x,y] := Wj?(x) n W™(y) consists of a single point 
also contained in R. The local product structure of a rectangle R is illuminated by taking z £ R: it happens that 
R = {[£,yl : x £ W u (z,R),y £ W s (z,R)} and there is a unique representation of the form x = [x",a; s ], where 
x (s,u) e w( s ' u \z,R). 

A collection of subsets TZ — . . . , R n } is called a partition of M if the Rj have pairwise disjoint interiors and 
[J • Rj — M. If the set Rj equals the closure intRj of its interior it is called proper. We shall generally not concern 
ourselves with ambiguities associated with the boundary dlZ, as it has (Ricmannian and SRB) measure zero and thus 
no physical significance. With this in mind, write 7Z(x) for an element of TZ containing x (TZ(x) is almost surely 
unique). A partition TZ — {Ri, ■ ■ ■ , R n } of M into proper rectangles is called Markov if 

TW s (x,Rj) C W s (Tx,R k ); W u (Tx,R k ) C TW u {x,R 3 ) (1) 

for x G int-Rj n T~ 1 int.R fe . It can be shown that M admits Markov partitions of arbitrarily small diameter. 

The Markov criteria means that the forward and inverse images of each rectangle respectively stretch across the 
unstable and stable directions in such a way that the stable and unstable boundaries of these images are contained 
within the stable and unstable boundaries of other rectangles. Markov partitions can be regarded as coarse-grained 
phase space cells well-suited for describing (a suitable time discretization of) physical dynamics, and particularly from 
the point of view of statistical physics. A suitable diameter for physical coarse-graining is one for which nontrivial 
intersections of the form intRj H T _1 inti?fc are connected and for which the physically interesting observables are 
approximately or effectively constant on the partition elements. 

A similar notion of Markov sections applies to Anosov flows. Details can be found in [14] , 

The key property of Markov partitions for the ergodic theory of Anosov systems is their correspondence to simple 
and natural symbolic dynamical encodings. Given a partition 1Z = {Ri, ■ ■ ■ , R n } of M, the entries a(7Z)jk of the 



corresponding transition matrix (not to be confused with a stochastic matrix, although see section XIII| for a related 
construction in this vein) are equal to or 1 if int-Rj PI T _1 intf?fc is respectively empty or nonempty. The dynamics 
of T w/r/t a Markov partition 1Z is captured by the suh shift of finite type or topological Markov chain 

S„ W := {s e {1, . . . , n} z : a(H). t ,, j+1 = 1} . (2) 

For s £ £ a (TC) * the set ir(s) :— f} ke % T~ k R Sk consists of a single point and n o £ — T o ir, where I denotes the left shift 
operator on S (-r\ that sends Sj to Sj+i. That is, points in M and symbolic sequences in S a (K) are in a well-behaved 
one-to-one correspondence (except on a set of measure zero corresponding to the images of the partition boundary) 
called a topological conjugacy. [136] 

This correspondence-and in particular the simple description of the symbolic sequences encoding the dynamics-is 
the payoff for the framework described above. A Markov partition (which can be realized in settings more general 
than Anosov or even Axiom A systems) enables the reformulation of dynamics in terms of a one-dimensional spin 
model [18] with short-range interactions in which the permitted spin configurations are specified by the transition 
matrix (a/k/a a nearest neighbor "hard core" interaction). In this context the SRB measure of the original system 
yields a Gibbs measure of the corresponding spin system. 
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This point of view immediately leads to nontrivial consequences which further serve to indicate the physical relevance 
of the Markov description of dynamics: for example, a unique SRB measure corresponds to the absence of phase 
transitions in one-dimensional short-ranged spin models [2]. By the same token, it suggests the construction of d- 
dimensional lattices of coupled maps corresponding to {d + l)-dimensional spin systems capable of exhibiting phase 
transitions |19j . However realizing such phenomena in the Anosov context will probably require either strong coupling 
and projections onto a subsystem, or perhaps both (see section XI for details). 

In this work we focus on aspects of the two paradigmatic Anosov systems: the Arnol'd-Avez cat map and the 
geodesic flow on a surface of constant negative curvature. 



III. A GEDANKENEXPERIMENT 



Before proceeding with particular systems, however, we shall make some general observations about the effective 
statistical physics of finite systems, and in particular the "experimental" character of the framework. Towards 
that end, consider the following simple classical Gedankenexperiment on a system represented by a mixing Anosov 
diffeomorphism T . 

The experimenter's access to the system (including the underlying manifold M) is limited to an "oracle" that 
accepts input x and an integer k and returns T k x with a resolution of e. That is, the experimenter can evolve the 
system perfectly, but she can only distinguish points (including initial conditions) that are at least a distance e apart. 

Assuming henceforth that M is compact and finite-dimensional, the experimenter can approximate a (sufficiently) 
regular partition X = {X{\ of M into rectangular microcells of nearly equal Ricmannian measure. She can also use 
the information gained from the oracle to approximate a suitable Markov partition TZ to a degree of accuracy limited 
only by her patience (as measured by the number of evaluations x Tx she performs) and e by adapting the original 
construction of Markov partitions by Sinai (see [T3] for details). For reasons that will become clear, it will make sense 
to consider both a microscopic regular partition and a mesoscopic or coarse-grained Markov partition simultaneously. 

The procedure to approximate the Markov partition begins with an initial partition 1Z$ into smooth connected 
(approximate) rectangles constructed by considering the action of T on suitable local stable and unstable manifolds 
through a set of points such that balls of small radius around them cover M. In order to form an (approximate) 
Markov partition, the stable and unstable boundaries d^ s ' u 'TZo (which are defined in the obvious way) are perturbed 
along the unstable direction along with relatively small expansions or contractions in such a way as to preserve the 
rectangular property. A new partition 1Z\ is defined so that Td s 7Zi C 8 s TZq and T~ 1 d u TZ\ C d u 7Zo. Now it turns 
out that the conditions Td s lZ C d s 7Z and T~ x d u lZ C d u 7Z imply that TZ is Markov. By an iteration of the procedure 
just outlined, therefore, a sequence {TZk} of partitions can in principle be constructed that converges exponentially 
quickly to a Markov partition. 

Because our experimenter is careful, she will have taken pains to establish that she has approximated TZ at the best 
possible resolution e (which we assume is comparable to the diameter of X and small compared to the diameter of 
the coarse Markov partition 1Z), so that in the overwhelming number of cases she can reliably obtain the partition 
elements {TZ(T k Xo)} corresponding to the initial condition x^, and in most cases even the elements {X(T k xo)} (and 
at worst one of the neighboring microcells, which will be functionally equivalent). Because the experimenter has no 
direct knowledge of T, she cannot compute anything from first principles, but instead must rely only on symbolic 
data of this form. 

Since Markov partitions are not unique, the experimenter will typically consider X to provide a more physically 
fundamental discretization than 1Z. Indeed, as we shall see in section pCIII her best (but still not unique) strategy for 
constructing a suitable 1Z will be to take its diameter as large as possible while keeping the SRB measures of elements of 
X that intersect Rj indistinguishable (up to a desired tolerance, which introduces a free but untroublesome parameter). 
It is nevertheless the case that Markov partitions are physically relevant, and not least for the reasons outlined in 
section [TTJ though it is the case that any particular Markov partition cannot be considered physically relevant in 
isolation. |137j 

Write p!^ (x) := A; -1 X^=o (T e x) for the empirical probability distribution. After some characteristic 
timcscale that is independent of the initial condition, the experimenter will almost surely be able to conclude that 
Y^i-XiDR jtBPi ( x o) i s converging to some value, which she will (be able to approximately) identify as hsrb(Rj)- At 
this time she stops (paying attention to) the evolution since as we shall see below this is a sufficiently long interval to 
extract an appropriate too. 

Under the physically reasonable assumptions above, she can obtain the probabilities and effective energies of the 
microcells straightforwardly. For the moment, write r and x for indices corresponding to 1Z and X, respectively. 
Assume that v x = v(X x ) = 1/\X\ and v r = v(R r ) = N r /\X\, and write [i x = fJ-SRB{X x ) = e~ lx /Z and fl r = 
HSRB{Rr) = e~ 7r jZ. Now provided that 1Z is (as it should be) small enough so that any interesting observables 
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are practically constant on its rectangles, we have that fix/^x ~ Mr(i)/^(i)i where X x n R r ( x ) 7^ equivalently, 
j x sa 7 r ( x ) + logJV,.^). We can use this equality to define 

7r(x) := lr(x) + logiV r ( x ) (3) 

and after writing E :— /3 _1 7 it follows that E r t x \ s» provided that /3 does not depend on 1Z or A" (as is the case 
for the situations of interest in this paper). Therefore nothing significant is lost by restricting attention to a Markov 
partition in general, and given (3 we can in principle reconstruct the corresponding effective energy function to an 
accuracy limited only by s. (Note also that if 2_7 r = 0, then 5^_7 r = logj ] , so we might in this event consider 
:= 7 r — log|«Y|/log|7£|, which again sums to zero.) 

In the case where hsrb = v (which is the case with the systems considered explicitly in this paper) ^ x = 0. However, 
since the more interesting case ^srb v is presently infeasible to examine in any detail, we do not bother with any 
such rescaling in the remainder of this paper (though we remark here that it appears to be ultimately irrelevant 
for the computation of the physically preferred inverse effective temperature). In particular, we will exhibit typical 
Anosov systems for which the effective temperature and spectral density (i.e., the energy levels, with only relative 
multiplicities accounted for) appear to have unique well-defined limits as a function of certain types of partition 
refinements. This result will inform most of our discussion throughout. 

The a priori identification of an appropriate characteristic time t x is perhaps the most difficult conceptual issue 
the experimenter faces. One reason for this is that several parameters that can be interpreted as timescales to consider 
arise rather naturally (though some may depend on the partition) . Among these timescales are the inverse topological 
entropy (see appendix [B]), the number of elements in the partition |138j . or the number of timesteps necessary for 
any two elements of 1Z to communicate, i.e. the least integer k s.t. a(lZ) k has strictly positive entries [20] ■ Like 
all of the candidates mentioned except the inverse topological entropy, a Poincare or recurrence time is evidently 
not an appropriate choice in the present context because it is far from intrinsic. To see this note that a particular 
discretization scale of the phase space is necessary for a Poincare time to be defined, but for a system such as the 
geodesic flow on a surface of constant negative curvature there is no single preferred finite discretization scale by the 
remarks above on effective energies. 

However there is another particular timescale that appears to be entirely appropriate for the present purposes, 
namely the so-called mixing time of the system. The relaxation time is generally similar and may also be considered. 
[139) See also section XIII for obtaining such a via the Ulam method. By replacing sets with characteristic 
functions and making an obvious generalization of scope, the property of mixing can be considered from the point 
of view of the time correlation function Cf,g(^) := J F ' {G o T k ) d[i — J F d/i ■ J G d/j,. Ruelle j2H and Sinai [2"2"] 
showed that Cp^ik) decays exponentially if T is Axiom A with dense unstable manifolds and a connected attractor 
(these criteria are conjectured to be satisfied for generic Anosov diffeomorphisms when M is connected; see also [23] 
for an extension to partially hyperbolic systems). For more detailed results in the particular case of hyperbolic toral 
autmorphisms (i.e., generalized cat maps discussed in section [v]), see [231 [25] , 

An analogous result for Anosov flows remained unproven for more than two decades until Chernov 26 utilized what 



amount to Ulam approximations (see section XIII I in the process of demonstrating decay bounded by an exponential 
in \ft for the time correlation function for (e.g.) geodesic flows on compact surfaces of variable negative curvature. 
The application of the Ulam method in this context rests on the construction of a Markov partition of small diameter 
(which can be done by refining any initial Markov partition by images under T) and subsequent approximation of /j, 
on a rectangle by a product measure with accuracy controlled (essentially) by the diameter of the small partition [27] . 
These results were later improved using spectral methods by Dolgopyat [35] and Liverani [39] to show exponential 
decay of Cf,g for the cases of clear physical relevance. 

It is natural to expect on purely physical grounds that the time correlation (which tends to zero iff /i is mixing) 
will decay exponentially. Although there are examples of Axiom A flows with arbitrarily slow decay of correlations, 
this expectation is nevertheless valid for the known physically relevant examples and conjectured to be valid for 
arbitrary mixing Anosov flows (it is known that the decay is superpolynomial in this case) [14] . Besides accounting 
for cases of obvious interest here, this circle of results indicates that we can use the decay rate or mixing time of 
the time correlation function-which depends only on the Anosov diffeomorphism or flow in all known cases-as a 
natural dynamically intrinsic timescale that is independent of the Markov partition, unlike most otherwise plausible 
candidates for t^, . The approximation of the mixing time is briefly touched upon in section |XIII| 

So now our experimenter has not only a natural timescale t^ but also for each initial condition a symbolic sequence 
of (approximate) partition elements, and she can form from the frequencies of symbols an empirical probability 
associated with each partition element. This is a gigantic amount of information, and it is only natural to assume 
that if T represented the dynamics of some equilibrium system, then our experimenter could in principle go beyond 
the construction of an (approximate) SRB measure to provide a full characterization of the Gibbsian statistical 
physics, including an effective temperature and an effective energy spectrum. Taking the idea that "there is no 



7 



conceptual difference between stationary states in equilibrium and out of equilibrium" [15] to its natural conclusion, 
the experimenter should be able to provide an effective temperature and energies in either case. 

This idea was anticipated in [3U], where the rates of expansion and contraction under the action of T were said to 
"provide an 'energy function' that assigns relative probabilistic weights to the coarse grained cells," and in fact was 
followed up with a proposal to define an effective temperature of a thermostat keeping the system in a stationary 
nonequilibrium state by W/S, where W is the work rate of external forces on the system and S is the entropy 
production rate (see also |31j). However, the proposal of |30j still requires reference to a predefined energy function 
of some sort in order to define a sensible notion of work rate. 



IV. THE EFFECTIVE TEMPERATURE 



There is a quick derivation of the Gibbs distribution for a finite system from the basic postulate that the probability 
of a state depends only on its energy. This approach has the benefit of motivating the construction of the effective 
temperature presented later in the section. While this approach does not motivate the introduction of entropy, the 
standard information-theoretic motivation provides an adequate remedy. We sketch the derivation here and note that 
it can be made rigorous without substantial difficulty. 

The key observation is that energy is only defined up to an additive constant. This and the basic postulate imply 
that 

P{Ek) ~ Y-JW) ~ E~fP^) (4) 



for some function / and e arbitrary. Define 



9e{s) ■= ik JTE, v (5) 



and note that g E (0) = 1 by definition. It follows that 



m /ei \ f{Ek) ,s f{E k + e) (R s 

V[Ek) = E 3 f(E 3+ e) 9E{£) = E 3 m + eY (6) 

Therefore f{E k )-g E (e) = f{E k +e), implying that f(E k + e) - f(E k ) = (g E (e) - 1) ■ f{E k ). In turn, since g E (0) = 1, 
we obtain f'(E k ) = g' E (0) ■ f(E k ). 
As a result, 

f(E k ) = Cexptf E {0)E k ). (7) 

Setting, without loss of generality, f3 a := —g' E (0) and C = 1 produces the Gibbs distribution so long as the "ordinary" 
temperature is defined via T := l/ksPo- Note also that g E (e) = exp(— (3 e) so that g E = g, as required for the 
self-consistency of the argument. Although the derivation here is only appropriate for a fixed j3 Q , this amounts to the 
canonical ensemble. 

Besides the naturalness of using an invariance principle, there is a good reason to treat the notion of temperature 
as central to the derivation of the Gibbs distribution in lieu of entropy. Namely, entropies can be introduced, defined, 
and applied in much broader contexts than temperature, precisely because of the abstract and general nature of the 
entropy concept. Temperature, on the other hand, has no obvious information-theoretical interpretation, and often its 
physical interpretation is nontrivial. It is nevertheless generally expected that the kinetic temperature (which serves 
as a convenient operational parameter in general physical situations by considering weak coupling) can be bijectively 
associated with the "true" temperature, though they may not be equal [32]. The present paper aims to broaden 
the scope of applicability of (effective) temperature but does not attempt to supply any information-theoretical 
interpretation. 

With the preceding arguments in mind, we pause briefly to consider some general implications of a widely applicable 
effective temperature before reviewing its derivation. A stationary (or sufficiently slowly varying) physical system 
with n < oo states is typically described in terms of its state energies and inverse temperature whenever possible. 
While this description greatly aids the theorist armed with a Hamiltonian in making predictions about the statistical 
behavior of simple systems in equilibrium, in practice many systems of interest are too complex or far from equilibrium 
to admit easy characterization in terms of their microscopic dynamics. They "are governed by emergent rules. This 
means, in practice, that if you are locked in a room with the system Hamiltonian, you can't figure the rules out in 
the absence of experiment" [33 . 



Such situations are remedied by the identification of effective theories. With a physically reasonable effective 
temperature determined in terms of experimentally accessible data, an experimenter can close the Gibbs relation 
to arrive at an effective energy function. This procedure appears to offer the prospect of formulating good effective 
physical theories for many systems beyond the scope of present methods. The present paper will develop this technique 
in the context of the two paradigmatic Anosov systems and also indicate how it can be applied more generally, including 
to manifestly nonequilibrium systems. 

We now give a brief summary of the derivation presented in [341 135] and drawing on earlier work in [361 137] . 

The behavior of a stationary system with n < oo states and characteristic timescale (here, a mixing or relaxation 
time) too and occupation probabilities pj may equivalently be described in terms of t := t^p = (ti,... ,t n ). Write 
H = (Ei, . . . , E n , for a tuple of notional state energies appended with an effective temperature. Now the typical 
goal of equilibrium statistical physics is to produce predictions along the lines H i— > p using the Gibbs relation 
Pj = Z~ 1 e~P Ej . While this map is obviously not invertible and too does not enter the usual equilibrium picture 
explicitly, as we shall sketch below there is an essentially unique physically reasonable bijection between t and H, and 
the map t H> H is also of interest for purposes of characterization. 

To construct this map, first (w/l/o/g) reset the zero point of energy so that J^j Ej = (with a trivial adjustment, 
we can later set J^j Ej to any constant, e.g. note also that this does not place any meaningful constraint on 

the effective internal energy J^jPjEj) an d invoke the Gibbs relation to obtain 



1 " 

/3E k = - V logp,- - logp fe . (8) 

77 ^ ' 



7fe 

n 
i=i 

Note that f3H = (71, . . . , 7 n , 1), so we can compute \\j3H\\ = \/\\j\\ 2 + I. If we knew ||£f||, we could obtain j3 = (3(t) — 
\/!l7ll 2 + 1/11-^11 ( as we ^ as H itself) trivially. We will accomplish this below using simple symmetry and physical 
considerations. 

A dilation 1 1— > Ct (equivalently, too *- > Ctoo) leaves p invariant, so by (|8| it must also leave 7 invariant. Under the 
Ansatz (3 = /3(t), this dilation effects the transformation 

F =M (7 ' 1) "^) (7 ' 1) (9) 

which is itself a dilation by (3(t)/ (3(Ct). This is equivalent to the observation that p is constant on rays in both 
t- and _ff-coordinates, where tk/^Zjtj — Pk — e ^ £ '/Ej e_|3£j ' By the same token, under mild differentiability 

assumptions on the map between t and H we have that dt — d||t||er*' <J=^ dH = d\\H\\ei H \ where er denotes a unit 
radial vector. That is, rays and orthants of spheres in t-coordinates map to rays and hemispheres in ii-coordinates, 
and conversely (see figure [2]). 

The observations above show that \\H(t)\\ = ||F(||t||i)|| = l//?(||t||i), where 1 := (1, . . . , l)/y/rl. Moreover, 

m = mm ■ VhF+i- (10) 

Meanwhile, a multitude of physical considerations detailed in j3H[3S] (e.g., dimensional analysis, scaling for ideal gas 
systems, extended canonical transformations, etc.) require that ft scales as too, so that (up to an overall constant) 



m = iitiivW+T=tooiiPiiv4NF+^ (ii) 

where ||7|| 2 is computed using 

The case of a two-state system serves to illustrate the basic features of the bijection between t and H depicted in 
figure [2j A few lines of algebra yield 



<'T ( ^ ■ 1 



1/2 



Ei = -±-\og^; E 2 = -Ei. (12) 



In the other direction, we enforce Ej H> Ej — (Ei + Ei)j2, so that again E 2 = —Ei and 

cosh 27 



2 cosh 2 71 



1-1/2 

- (27? + 1) 



e" 71 _ e 71 

' e" 71 -I- e 7-1 ' 2 ~ 00 e-T 1 + e 71 ' 



With (11) and the Gibbs relation to obtain the energies, an experimenter has everything she needs in order to 
provide an effective description of the system in the idiom of equilibrium statistical physics. The principal difficulties 
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1 1 



FIG. 2: Geometry of the bijection t o H for two states. Contours of f3 1 = 1, 2 are shown in both coordinate systems. 



associated with the practical application of (11) are generally the identification of an appropriate coarse-grained 



partition of phase space and timescale. Even in the case of a single-spin system, where the partition is not an issue, 
and a proper choice of timescale exists, this choice is still not completely obvious (see appendix [C]) . Provided however 
that the partition and timescale chosen for an actual equilibrium physical system are appropriate, this effective 
framework will (re)capture the essential features of the usual statistical physics. More importantly, however, it allows 
equilibrium and steady-state (or even sufficiently slowly- varying) nonequilibrium systems to be treated on the same 
footing. 

The central theme of this paper is that given too of the sort mentioned in section [ITU there appears to be a uniquely 
determined choice of effective inverse temperature for Anosov systems based on physically reasonable considerations. 
The particulars bear close similarity with the well known variational principle relating the topological pressure and 
entropy of dynamical systems (see appendix |B|) . 



A SIMPLE MARKOV PARTITION FOR THE CAT MAP 



Let A £ ±SL(d,1t), i.e., A is an integral d x d matrix with determinant ±1. If A has no eigenvalues of modulus 
1, then A has both stable and unstable eigenspaces, AZ d — Z d and A determines an invertible map from the torus 
T d = M. d /Z d to itself. For these reasons such maps are called hyperbolic toral automorphisms; they provide the 
simplest examples of Anosov diffeomorphisms. 

The Arnol 'd-Avez cat map Ta is the two-dimensional hyperbolic toral automorphism defined by A = ( 2 \)- That 
is, Tax = Ax mod 1, where the modulus is taken componentwise. Henceforth we shall restrict ourselves to this choice 
of A unless stated otherwise. The relative simplicity of the cat map (or of more general two-dimensional hyperbolic 
toral automorphisms) provides fertile ground for exploring the effective statistical physics of Anosov systems. For 
example, the SRB measure is just (the pushforward of) Lebesgue measure, which also serves as a probability measure. 

With this in mind, we describe a Markov partition for the cat map originally constructed by Adler and Weiss. For 
< 9 < 7r/4 consider the partition 1Zq{6) of T 2 indicated in figure pi Writing c := cos 6*, s := sin#, e+ := (c, s)* and 
e_ := (s, — c)*, it is easy to see that the nontrivial vertices of both rectangles in 1Zq(9) are w\ = se + , = e2 + se_, 
and W4 = (c + s)e + — ex- Moreover, c 2 = and s 2 = §Ej*! where </> := is the golden ratio. 

Write A± = (f> ±2 . For A := cot" 1 (f> it is easy to verify that e+ and e_ are respectively the normalized unstable and 
stable eigenvectors corresponding to A; the eigenvalues are A±, and their logarithms are the Lyapunov exponents. 
Thus like TZq(0a), the rectangles in the partition Ta~R-o(0a) of T 2 have edges parallel to the x±-directions; more 
specifically, T A W\ ~ sA + e + , T^u^ ~ sA_e_, and T4W4 ~ (c + s)A + e + , where ~ indicates equivalence under the 
projection tt : M 2 — > T 2 . 

It follows that TZa = 7£q(#a) ^TaT^o(0 A ) is a Markov partition for Ta, where the join of partitions (i.e., the partition 
formed by intersections) is indicated by the symbol V. Indeed, elementary topological considerations along with the 
equalities A + s = c+ s and A_c — c ~ s suffice to establish that TZa is as shown in figure |4j In particular, the lengths 
of the resulting five rectangles in the contracting (x_) direction take on only two values: call the greater length a and 



the smaller length b. It follows that a = 2s — c and b = 2c — 3s, and from here all the points it* 
in figure [4] can be calculated straightforwardly. 

Note that if 1Z is a Markov partition for T then so is \]™_ 



and indicated 



/ T 3 TZ. Bearing this in mind, it is a straightforward 



10 




FIG. 3: A partition TZ o (0) of T 2 . The shaded angle 9 is in (0, tt/4). 



Markov partition for the cat map 




FIG. 4: TZa = 7?-o(#a) V Ta1Zo(0a) is a Markov partition for the cat map Ta- The constituent rectangles are labeled in order 
of their original encounter by starting at the origin and proceeding counterclockwise around the unit square, so that the first 
rectangle is the darkest and the fifth is the lightest. 

(if slightly tedious) combinatorial exercise to show that a partition of the form \J™__ m , T 3 a 1Za always contains tlq := 
F 2 ( m +m')+ii n i := 2F 2 ( m+m /) + 2, and n 2 := ^2(m+m')+3 rectangles of relative measure 1, 0, and 2 , respectively, where 
Fk = (4> k — (— (f>)~ k )/\/5 is the fcth Fibonacci number (see figurepl). The corresponding probabilities are 1/S, 




FIG. 5: (U A )m = VT=o t aK for (L) m = 0, (C) m = 1, and (R) m = 2. 
and </> 2 /S, where £ :— n + nitfi + n 2 (f> 2 . Now ||p|| 2 = (n Q + ni<f> 2 + n 2 4 )/£ 2 , and substitution yields 

,, ||2 = F2( m +m')+i + 2f 2(m+mO+2 2 + F 2(m+m >) +3 4 2(m+m,)+1 (l + 20 3 + 6 )/V5 
m {F 2 ( m+m >)+i + 2F 2{m+m , )+2 + F 2(m+m , )+3 2 ) 2 ~ 04 (m+m ')+2(l + 202 + 04 )2/5 ■ 
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Reducing powers of using <fi 2 = <fi + 1 leads to the asymptotic formula 

||p|| 2 „ 4 ^0-2(m+™') = . Ci4) -2( m+m >) (15) 

250 

with Ci fa 0.2211. 

Meanwhile, with n = m + n 2 + n 3 , we can see that nT 1 logp^- = (no/n) log(l/E) + (m/n) log(0/E) + 
(n 2 /n) log(0 2 /E) = (ni/n) log + (2n 2 /n) log — log E. When we form 7 fc = n _1 ^ . logp,, — logpfc the contributions 
from log E cancel, so that 7 has no entries equal to log 0- (n\ + 2n 2 )/n, m entries equal to log 0- (ni + 2n2)/n — log = 
log(/> • (n 2 — n )/n, and ri 2 entries equal to log</> • (rii + 2n 2 )/n — 21og0 = — log</> • (2n + ni)/n. Substituting for 
the rij gives that 7 has F 2{m+m , )+1 entries equal to 21og0 • F 2(m+m r )+i / F 2{m+m , )+b , 2F 2(m+m , }+2 entries equal to 
log0 • F 2 ( m+m ') + 2/F 2 ( m+m ') +5 , and F 2(m+m /) +3 entries equal to -21og0 • F 2{m+m , )+? ,/ F 2{m+m , )+b . 

It follows that 

(log \ 2 
~B ) ' (^2(m+m')+l ' 4 ^f(m+m')+4 + 2 J 7 2(m+m')+2 ' ^2(m+m')+2 + ^2(m+rn')+3 ' 4-F 2 ( m + TO ' )+3 ) ( 16 ) 

and using Fg ~ 0* /\/5 gives 

1 2 M 

|| 7 || 2 „ . (1 6 + l O )0 2 (™+™')-4 =; C2 02(m+m') (17) 

V5 

where C 2 w 0.5422. 

These approximations above are very good; note also that although the number of states increases exponentially, 
the density of spectra (i.e., not only the values of E but also their relative multiplicities) converges. 

With this in hand, we can quickly establish the behavior of /3/i<x, as a function of the class of Markov partitions 
induced by 1Z a- To a very good approximation we have that for m + m! » 1 

j- = IWI V¥f¥+1 * Ibll • INI= « 0.3463. (18) 

So the dependence of the effective temperature on the partition of the form described above is essentially nonexistent 
once its diameter is sufficiently small. As we shall see, this behavior holds more generally and yet is nontrivial. 



VI. LIMITING BEHAVIOR FOR P/t x UNDER REFINEMENTS OF THE MARKOV PARTITION BY 

ITS IMAGES 

Given a Markov partition 1Z for Ta , let z + and z_ = z^ denote the extents of the corresponding rectangles in the 
unstable and stable directions, respectively. By the Markov property, the number of (connected components of) 
rectangles in 1Z^ n :— Vjlo ^A^- that are contained within the jth rectangle of 1Z and have extent z_l in the stable 
direction is a well-defined integer. (Note that when considering SRB measures of rectangles in \J™__ m , T^TZ it suffices 
to set ml = by the T-invariance of ^srb- We exploit this here and henceforth for notational convenience more than 
anything else.) 

Let akj be the number of times that the interior of TaR/c crosses the interior of Rj in the stable direction-i.e., 
a is the so-called Markov matrix of TZ, and its spectrum consists of the absolute value of the spectrum of A along 
with zeros and roots of unity [35]. |140) Now B^ is the number of rectangles in TaTZ^ H TaRjc with stable extent 
X-Z^l = It follows that J2k a kjB^ gives the number of rectangles in TaTZ^ n Rj = 7Z^ n+1 H Rj with stable 

extent that is, we have £?( m+1 ) = a*B^ m \ 

The matrix B^ has a left inverse, and furthermore z_ := B^ m \z^ gives the corresponding extents of rectangles 
of lZ y m in the stable direction. This gives us everything we need in order to compute /3/ioo f° r this family of partitions. 
Because the SRB measure is Lebesgue measure, it admits a product decomposition along the stable and unstable 
directions. There are B^ rectangles of measure (j™^ := z +J zi"2, so 

bii 2 =E<Wi"; } ) 2 (19) 
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and 



/ 1 y 



(20) 



The useful aspect of this construction is that i?( m ) and zi'™" 1 can generally be computed without too much difficulty 
and only slight tedium. For Ha, we have by visual inspection that z + = c(0, <p, <p, 1, 1)* and that 



3 2\ fz- tl 
2 1 U-,2 



(21) 



so Z- i = 2s — c = b(j), where b := z_,2 = 2c — 3s. Moreover z_ = zf "* = b ■ (4>, 1, 0, 0, 1)* and again by inspection 



£>(0) 



Since (as visual inspection of figure [8] makes clear) 



/0 A 

1 1 

1 

1 

Vi o/ 



/l 1 1 0\ 
' 1 1 1 ' 
10 110 
10 1 
\0 1 1/ 



it is easy to check that for m > 



B {m) = a * B (m-i) = 



/ F2m ^2m+l\ 

-Fbm ^2m+l 
\-F2m-l -Pbm / 



/ 1 



4> 2m 



Vr 1 1/ 



where as usual Ft — (cjr — (—</>) )/v5 is the £th Fibonacci number. 
Now we note that the inverse of 

-F2m-1 -P2m / 



(22) 



(23) 



(24) 



(25) 



Fi m -\ —Fim 

Using this and the row redundancy of £?( m ' and z_ we can easily compute — £?(" l )\z_, obtaining 



Thus the matrix Q m > with entries £ 



(m) . 



;( m ) = cbcb- 2m 



/ 1 9 



Vi 0/ 



(26) 



(27) 



(28) 



13 



and 



\\pf=J2 B ^ (cir } ) : 
jk 



l2m 



V5 



(cb(f>- 2m ) 2 • (6 2 + 6 5 + 6 + 6 4 + <t> 2 + 5 + 1 + 3 + 0" 1 + 4?) 



(29) 



where the sum in parentheses on the RHS is in lexicographic order w/r/t matrix entries. Simplifying this using 
d> 2 = 6+1 leads to 



4r 2 h 2 

| 2 - ^ 7 t(50 + 3)0" 2m = C 1 0- 2m 
V5 



with d w 0.2211 as before. 
By the same token, we have 



(30) 



ii7ii 2 -E^ ( " 



X 



E 



V B [m) ^ X b Y 



R (m) Am) ( m ) 

log log 



Y 



for any scalars X, Y. The choices X — <fr 2m / \/5, Y = cb6 2m are most convenient and lead to 



D 



(m) 



,(m) 

^ log -^r- - log 6 • (1 + 20 + 6- 1 + 2 + 1 + 26 + + 6 + + 1) = log 6 ■ (60 + 4). 



3,k 



X 



Y 



Furthermore, Ylj k Bj™ /X = 50 + 3, so if we note that log( 



1) log 6 then 



Ml 



log 2 4 
V5 



(A 3 + 0A 4 + 1 A 3 + A 4 + A 3 + 0A 4 + A 2 + 0A 3 + 1 A 2 + A 3 ) 



where A £ := (26 - I) 2 = 4(1 - 1)6 + U-2. Now 



INI 



log 2 



V5 



(0[-40 + 8] + [26 + 2} [-86 + 13] + [26 + 1] [-120 + 20]) • 2m = 



(31) 



(32) 



(33) 



log 2 6 ■ 6 2m = C 2 2m (34) 



where as we have already seen C 2 ~ 0.5422. 



In summary, this reproduces the calculation of the previous section: fi/too ~ \JC\C 2 « 0.3463. 

Another Markov partition TZ' A for Ta is indicated in figure [6j The calculation for 1Z' A is carried out along entirely 



Markov partition for the cat map 




FIG. 6: Another Markov partition 1Z' A for Ta- For concreteness, we label the darkest rectangle R[, the next darkest rectangle 
R' 2 , the next darkest rectangle R3, and the white rectangle R' 4 . 

similar lines. We have z + = s(l, _1 , 1, cf>)* and z_ = s(l, 1, 1, -1 )*- Visual inspection of figure [9] shows that 



D 



(0) 



/0 IN 

1 

1 

Vl 0/ 



(35) 
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and 



so that for m > 



a = a(lZ' A ) = 



Flm F2m+1 




^2m 



/ 1 



We can compute zi™' 1 = B^ m \z- just as for 7£a 5 obtaining 



zi m) = s6- 2m - 1 



Thus the matrix C (m) is 



/ 1. 



„2j-2m-l 



(36) 



(37) 



(38) 



(39) 



and 



ibii^z^ (c^y 

which simplifies to 



(s 2 - 2m -l )2 . (1 + ^ + ^2 + + : + ^3 + ^ + 4) 



with C( fa 0.2733. 

Meanwhile, taking X = 2m /v / 5, Y = s 2 0~ 2m_1 leads to 



-2m 



/ i-2m 



5 



(m) 



V log - log0 • (0 + - 1 + + + + (T 1 + 2) = 30 log ( 



Y 



Similarly, £ B^/X = 30 + 4 + 0" 1 = 40 + 3. Writing , _ , 



3<p 



t, we obtain 



(40) 



(41) 



(42) 



hll 



y/5 



log 2 ■ (A' + + A'_! + 0A[, + A' + 0A' X + ^A'j + A' 2 ' 



Now 



hll 



vi 



log 2 0' 



30 



N + 3 1 - 7X^ 6( ^ + 30 + 4 ) = 



40 + 3 



log 2 280 + 15 
40 + 3 



=-.a 



(43) 



(44) 



with C' 2 ps 0.6593. 



So for the family of partitions induced by 7^ we have that ft /too ~ v/ C'JCj ps 0.4245 7^ 0.3463: the limiting values 
of P/too differ for the partition families induced by TZa and TZ' A . 

However, another initial partition gives the same result as for TZa- For the two-element Markov partition TZ A from 
example 1.4 of and depicted in figure 10 below, we have z + = (s, c)* and Z- — (s, c)* . From figure 10 we see that 



B <»> . ; ; 



(45) 



15 



and 



so that for m > 



o( m ) = f^m-l F 2m \ U- 1 1 



(47) 



We have 

>0 (B W r i 2 . = f ^ = s 0-- f 1) (48) 



so that 



*(m) _ 2 i-2ra 



(49) 



and 

bll 2 = £ B £° (Cj^) 2 ~ C^V 2 " 1 ) 2 • (r 1 + 2 + 2 + 5 ) = ^ • (20 + 1) 0~ 2m =: Crr 2m (50) 



with C(' « 0.5789. 

Continuing, with X = 2m /\/5, Y = s 2 (f)~ 2m we have 



£ 



J ' fe log - log0- (0 + 1 + 1 + 2$) = 20 2 log0 (51) 



X ° Y 

and 1 B if/ X = <t> + 2 + t' 1 = ^- Sincc ■ 2( t> 2 lQ g^ = 2 (<^ - 1) lo S< 



which simplifies to 

A,2m 



^ log 2 ■ (0" 1 [2(0 - l)] 2 + [2(0 - 1) - l] 2 + [2(0 - 1) - l] 2 + 0[2(0 - 1) - 2] 2 ) (52) 



„ „ _ log 2 • (0- 1 [-40 + 8] + [-80 + 13] + [-80 + 13] + 0[-120 + 20]) = ^= log 2 • 2m =: C 2 '0 2m (53) 

v 5 v 5 

with C 2 ' « 0.2071. 

Therefore for ft'^, we have - s/C'{C'i w 0.3463. Note that s/C'{C'i = s/ChCi ^ y/C[C' 2 . This suggests that 

differences between limiting values of /3/ioo among initial Markov partitions has a subtle origin. 

The key implication can be loosely put that while (3 /too may have a complex dependence on the "shape" of the 
Markov partition used, it does not depend substantially on the "scale" of that partition. This holds in greater 
generality, as we proceed to sketch below. 

For a generic two-dimensional hyperbolic toral automorphism, the rectangles in a Markov partition are not rect- 
angular in the Euclidean sense, but instead are (unions of connected) parallelograms since the defining matrix 
A e ±SL(2,Z) need not be symmetric. Nevertheless, the key elements of the scheme outlined above remain ef- 
fectively unchanged, provided that one first defines a defect angle as the difference between the angles of the two 
eigendirections less ir/2 and then multiplies by the cosine of the defect angle in the calculations for ||p|| 2 and 
||7|| 2 . Because this cosine contributes only a multiplicative constant for ||p|| 2 and does not contribute at all to ||7|| 2 , 
it does not affect the qualitative picture. 

In general, by linearity and the Markov property B^ will satisfy a linear recurrence relation with integral coef- 
ficients. For A = the recurrence relation is £?( m ) = ZB^^ 1 " 1 — B( m ~ 2 \ whereas for A = (f 2 ) the recurrence 
relation turns out to be B^ = AB {m -^ - B {m - 2 \ (Note that unlike the cat map, the second example does not have 
orthogonal stable and unstable directions.) The recurrence relation itself does not depend on the particular Markov 
partition, but only on the defining matrix A G ±SX(2,Z) and the initial condition. 
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A sketch of this is as follows. If A £ SX(2,Z) then its characteristic polynomial is Xa{v) — V 2 ~ TrA ■ y + 1. By 
the Cayley-Hamilton theorem, xa(A) — 0, so that powers of A satisfy a second-order linear recurrence relation with 
integral coefficients. Moreover, the spectral radius of a satisfies p(a) — p(A) for any Markov partition TZ by Theorem 
2.2 of 38J. It follows that 5< m+1 ) = tfB^ satisfies (up to the initial condition) the same recurrence relation as the 
powers of A, viz. B^ = TrA • SC" 1 " 1 ) - B (m - 2 \ 

More importantly for our purposes, and simply by virtue of the fact that B^ satisfies a linear recurrence relation, 
it has a well-defined scaling behavior that (by construction) is the inverse of that for z_ . Because ||p|| 2 scales as 

and ||7|| 2 scales as B^ m \ the two terms counterbalance each other and so /3/too converges to a finite nonzero 
value as to t 00. It follows similarly that the density of spectra (i.e., the effective energy values counting only relative 
multiplicities) converges to a finite set. 

The linear substructure of hyperbolic toral automorphisms appears to be the fundamental reason for this phe- 
nomenon. Although even in three dimensions Markov partitions for hyperbolic toral automorphisms are necessarily 
fractal |13) . it is reasonable to conjecture that some weaker form of this phenomenon should continue to hold in 
higher dimensions, since linearity is still preserved. Moreover, because every Anosov diffeomorphism of the torus T d 
is topologically conjugate to a hyperbolic toral automorphism and these maps play such a central role in the theory 
of Anosov diffeomorphisms, the significance of the above phenomenon should not be underestimated from the point 
of view of statistical physics in general, and of the chaotic hypothesis in particular. 

That said, and although there is a sense in which SRB measures for Anosov systems are almost product measures 
|39j . it is far from clear that the analogue of the recurrence relation here should necessarily enjoy the same linearity 
properties. Indeed, we shall see in section |XV| that the situation can be considerably more complicated in general, 
although consideration of physically reasonable refinements of Markov partitions still appear to yield unique limits 
for /3. 

Finally, while the results of this and the preceding section serve to motivate the relevance and applicability of 
effective statistical physics to the Ruelle program, they should not be misinterpreted as fundamental (apart from the 
asymptotic independence of /3/too w/r/t scale). Their import derives from the indirect evidence they provide both 



to support and to further constrain the functional form (11) of the effective temperature, and also for the impetus 



they provide for the results in later sections. Specifically, these results suggest that neither j3 nor too should have 
any explicit dependence on the number of states, which in turn lends additional credence to the already-mentioned 
identification of too with the mixing time for Anosov systems and motivates the approach taken in section [Xj 

VII. NONTRIVIALITY OF LIMITING BEHAVIOR 



The balancing act detailed above in sections |v| and VI is far from trivial. To see this, first let q £ (0, 1). Consider 



a family of partitions of the unit interval defined inductively by setting y^ := {[0,1]} and forming y( m+1 ) by 
subdividing the intervals in y( m ) into subintervals of relative (Lebesgue) measure q and 1 — q. 

It is not hard to see that the corresponding probability tuple p^ has (™) entries equal to q r (l — q) m ~ r , from which 
it follows that 



U m) \\ 2 = {q 2 + {l-q)T- (54) 



Similarly, 



2~ m logpf 5 = to Q log ^- + log(l - q)j , (55) 
where we have used J2 r (?) = 2 ™ and Er C") r = m2 m - 1 . Now 7M has (™) entries equal to (f - r) log j^, so 

ll7 (m) H 2 = V ^ • £ (™) (f -rY = log 2 • to2— (56) 

^ r=0 ^ 7 ' 1 

where we have also used J2 r ("r) 7 " 2 = m ( m + 1)2'™~ 2 . The net result is that /3/too cannot have a finite nonzero limit 
in this situation: indeed, /3/too tends to infinity as to increases unless q — 1/2, in which case /3/too tends to zero. 

Viewed another way, the existence of a nontrivial limit requires that ||7|| 2 = —n log 2 Z + ||logp|| 2 and ||p|| 2 have 
precisely inverse scaling behavior (note that logZ = — ^ logpj and that n has its own scaling behavior). Even if 
the entries of e Km • p are bounded by polynomials in to, this cannot be expected to hold in any generality. In this 
context the fact that /3/too converges in the manner detailed earlier can be seen to take on a special significance, 
particularly in light of the relevance of Anosov systems and Markov partitions for nonequilibrium steady states. 
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VIII. AN IMPLICATION FOR THE DETAILED FORM OF THE EFFECTIVE TEMPERATURE 



The derivation of the form for /3 actually leaves open the possibility that /3 = f(n) ■ ioolMI \/||7l| 2 + lj where 
11 = dimp. For p uniform this simplifies to /3 = f(n) ■ t oc /y / n, so it might therefore be imagined that an appropriate 
choice of / other than unity would be fin) = \fn. 

Another slightly more detailed argument in the same direction is as follows. Consider a Hamiltonian system 
with phase space measure T and such that T(E) := J H<E dT < oo. If the system is ergodic on constant-energy 
levels then an expression for the inverse temperature that is particularly well-suited for low-dimensional systems is 
Pb{E) — dE^ogT(E), and up to an additive constant the concomitant entropy is given by S — logr(-E) [40]. (In 
the usual expressions, the volume T(E) is replaced by the relative volume 9eI\) If now the system is mixing and 
a constant-energy level H = E is decomposed into cells of equal measure, then the inverse effective temperature 
simplifies to (3 = t^ii^ 1 / 2 , where n is the number of cells. This would seem to suggest that a factor n -1 / 2 should be 
absorbed into the expression for the effective temperature, so that would be proportional to the relative volume 
of the energy level, which is just c^r. In such an event we would have (3 = C8eT and (3 B = T(E)^ 1 dE^- 

Based on these considerations, scaling according to f(n) = yfn might seem to have the advantage of a well-defined 
microcanonical limit and a clear correspondence with established low-dimensional statistical physics, but as we have 
seen above that is actually not the case in at least one example where the SRB measure (which we recall generalizes 
the microcanonical measure) is uniform. Later we shall encounter another example in the geodesic flow on a surface 



of constant negative curvature: see section XVI 



Indeed, consider the continuous version of the formula for j3: for this to make sense it is necessary that hsrb be 
absolutely continuous w/r/t v 1 which amounts to saying that the probability density p of the SRB measure w/r/t v is 
well-defined and determined by the equality J x dfisRB = J x V dv for all v- measurable sets X. Although for hyperbolic 
toral automorphisms and geodesic flows this is the case (indeed, here p = 1), in general such a p does not exist: a 
SRB measure is generally not absolutely continuous w/r/t v (in fact absolute continuity of the conditional measure 
on unstable manifolds alone is an equivalent criterion for defining SRB measures |17|). For example, deterministic 
dissipative systems will typically contract phase space, so the concomitant SRB measure will necessarily be supported 
on a (typically dense) set of zero Riemannian measure [2], At the same time, the restriction of hsrb to a tr-algebra 
generated by any given set of finite microcells produces a well-defined density on microcells. This observation is 



related to the Ulam method (see section XIII ) . This makes it clear that in practice a discretization is always required 



on physical grounds for nonequilibrium steady states (and is usually invoked in equilibrium as well). 

So while considerations of the microcanonical ensemble might be taken to suggest that should be scaled so as 
to absorb a "missing" factor of y/n in a putative continuous limit, the detailed examination of Anosov systems as 
motivated by the chaotic hypothesis casts doubt on this idea. 

Finally, there is a preferred (characteristic) scale for a Markovian coarse graining: namely, just small enough in 
order for any physically interesting observables to be well-defined and effectively constant on rectangles. Meanwhile, a 
secondary fine-graining also arises naturally in the consideration of entropy production in nonequilibrium steady states; 
its characteristic scale determines the microscopic (local, quasi-ergodic) description of dynamics via a relationship 
between the mesoscopic coarse-graining scale in concert with the Lyapunov spectrum and a concomitant time scale 
[30j . These considerations provide further support to the idea that an effective inverse temperature should be (at 
least somewhat) insensitive to the scales of phase space discretizations. (See also section XIII ) 

As it happens, our own considerations naturally and felicitously lead to a class of microscopic Markov partitions as 
phase space discretizations that appear to obviate scaling issues of the sort discussed here and that furthermore also 
have physically reasonable uniformity properties. It is to this aspect of our framework that we now turn. 



IX. UNIFORMITY OF PARTITIONS 



The variation in the measures /J-SRsiRj) for Rj £ 1Z is responsible for the variation in coarse-grained state energies. 
However, any reasonable fine-graining of phase space should be approximately uniform w/r/t the Riemannian (or 
associated Liouville) measure. In the examples considered explicitly in this paper, [Asrb = v. This results in a degree 
of tension as one expects the dynamics to contribute to an effective temperature in some way other than through too 
even for these simple cases, a possibility that uniform SRB measures of rectangles would preclude. But as we shall 
see in the sequel and thereafter, this tension also suggests a physically reasonable method for coarse-graining that is 
at the heart of our results. 

Recall that the Anosov property is independent of the particular choice of Riemannian metric. By locally perturbing 
the metric inside of rectangles, the concomitant Riemannian measure can be made to satisfy v{Rj) = |41j . 
However, such a construction is far from natural, and is physically unmotivated. Moreover, the considerable evidence 
that "geodesically conjugate" Riemannian manifolds of negative curvature are isometric casts doubt on the existence 
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of an analogous approach for geodesic flows [HI 03]. More to the point, the Riemannian/Liouville measure associated 
with such a modified metric-which in the case of a surface of negative curvature is also the SRB measure-will not be 
invariant under the original geodesic flow [5J. 

It is therefore appropriate to briefly discuss the sense in which we may restrict consideration to a "natural" Rie- 
mannian measure. While not every Anosov system will preserve a natural Ricmannian measure, the prototypical ones 
will: for hyperbolic toral automorphisms, this is just the (pushforward of) Lebesgue measure, and for the geodesic flow 
on a surface of constant negative curvature it will be the Liouville measure. More generally, so-called conservative 
diffeomorphisms preserve a natural Riemannian measure (and diffeomorphisms in general preserve an equivalence 
class of measures) 44 . A wide class of conservative diffeomorphisms is furnished by Hamiltonian systems, and it 
is natural to couch the otherwise implicit notion of a "natural" Riemannian measure in this context (we will briefly 
touch upon the related context of Hamiltonian systems as they pertain to nonequilibrium statistical physics below) . 

Bearing this in mind, it appears unlikely that a Markov partition can typically be constructed such that (either) 
a "natural" Riemannian (or the SRB) measure is uniform on rectangles [T5]. [141] In the cases considered explicitly 
here, viz. the cat map and the (Poincare map for the) geodesic flow on a surface of constant negative curvature, the 
measures of rectangles will often be controlled in a weaker sense. Suppose that (i is a measure of maximal entropy 
(see, e.g., |45j for background), as is the case for the examples mentioned here. [142 Then the Kolmogorov- Sinai or 
metric entropy equals the topological entropy [143j : i.e. 

- wpWl £ M*)iogM*)-n (57) 

where as before we write TZ^ n := \f^LoT 3 TZ. It follows in this event that the ratios of terms of the form log//(i?) 
tend to unity. However, this only implies that the differences between measures of rectangles are subexponential, 
and suggests that the utility of entropy for analytically evaluating the uniformity of individual partitions is limited 
in practice. 

Basically, it appears unlikely that Markov partitions for Anosov systems can be generically constructed so that the 
Riemannian measures of rectangles are precisely uniform, though virtually by default the measures of rectangles will 
differ only by polynomial factors. This deficit, which might appear to be a mere nuisiance, in fact suggests an avenue 
whereby a significant conceptual link between effective and traditional equilibrium statistical physics can be forged. 



X. GREEDY REFINEMENTS OF MARKOV PARTITIONS 



Define TZ V T Rj ~ {Rk} k ^ U {T^ 1 (TRj H R k ) : a jk ^0}. We have the following 
Lemma. TZ V t Rj is Markov. 

Proof. See appendix [S] A heuristic sketch reduces to the definition of a Markov partition: namely, the images of 
rectangles in T(7\L VY Rj) will stretch across rectangles in TZVt Rj in the unstable direction, and rectangles in TZVt Rj 
will stretch across the images of rectangles in T(TZ\/t Rj) in the stable direction. □ 

Corollary. If S is a refinement of TZ and TZ VY Rj is a refinement of S, then S is Markov. In this event call S a 
quasi-minimal refinement of TZ. □ 

The generalized refinement TZV^ Rj ■— {Rk}k^j^{T^ t {T t Rjr\Rk)} fails to be Markov for t generic, since (e.g.) the 
analogue of iv) in the proof fails to hold. This obstruction precludes arbitrarily detailed control over the refinement of 
Markov partitions. However, the residual control offered by quasi-minimal refinements is still of considerable interest, 
as we proceed to illustrate. 

Call a quasi-minimal refinement minimal if it is not a nontrivial refinement of any nontrivial quasi-minimal refine- 
ment. Set TZq := TZ. For k > 0, let TZk+i be a minimal refinement of TZk of maximal ^-entropy. (An alternative 
extremal principle could turn out be more appropriate in general, e.g. minimizing the effective free energy.) These 
greedy refinements are asymptotically unique, in the sense that any two sequences of greedy refinements of an initial 
partition must share infinitely many elements. 

It can be shown that a sequence of greedy refinements of TZa contains a subsequence of refinements that are local 



maxima of the normalized entropy S U (TZ) / log\TZ\ (see, e.g., figure 17 for an example of this phenomenon in a more 
complicated setting). The rectangles in the elements TZA,m of this subsequence have relative measures 1 and <j) an d 
respective multiplicities L m+ \ and L m+ 2- Here the Lucas numbers are given by L m+ 2 — i m +i + L m with L\ = 1 and 
L 2 = 3, so that L m is the closest integer to (f> m . (See figures [7] and [8] Note also that the recurrence relations for the 
Fibonacci and Lucas numbers differ only in their initial conditions.) 
It follows that 

|b|| 2 = ,^" +2 f + £ "' + ! 2 « U- m ~ l (58) 
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Greedy refinements of R A 




FIG. 7: Markov partitions 1Za,i, TZa,2 and TZa,z obtained by the first three rounds of greedy refinements of TZa, indicated 
respectively by subsequently considering the solid, dashed and dotted line segments. 




FIG. 8: Depiction of greedy refinements in eigencoordinates of Ta- A portion of the unit (Cartesian) square and translates of 
the current partition are shown along with the action of Ta on the current partition. New boundary elements introduced by 
greedy partitioning are shown in bold. (L) 1Za,o- (R) TZa,i- 



and 



so that for TZa. 



| 7 || 2 = Lm+2L ™+1 + L »'+l^+2 lQg 2 M lQg 2 ^ (5Q) 



T 2 



P/toc ->■ J A log « 0.2393. (60) 
V 50 

Besides producing another family of partitions-by a different mechanism-for which /3/too converges to a finite value, 
this also results in the lowest asymptotic value seen thus far. This is to be expected because of the uniformity of the 
partitions so obtained, which serves not only to maximize the entropy but also to minimize ft /too- 

Remarkably, we obtain precisely the same results when starting with TZ' A instead. To see this, first consider the 
T^-A,m- TZa has five rectangles: one with relative measure 1, two with relative measure 0, and two with relative 
measure 2 . We summarize this in the obvious shorthand (1 x 1, x 2, 2 x 2)@iz A - The greedy refinement 7^.1 is 
obtained by dividing each of the two rectangles of relative measure 2 into two subrectangles of relative (w/r/t TZa ) 
measure 1 and 0, leading to (1 x 3,0 x 4)@n A i- To get TZa,2, each of the four rectangles of relative measure is 
further decomposed, and so on. (See figures [7] and [8j) Meanwhile, we have (1 x 1,0 x 3)@-r,^. Subdividing the three 
rectangles of relative measure produces TZ' A \, for which we have (1 x 3, x 4)@-r/ : see figure [9] Therefore we see 
not only how the Lucas numbers come about, but more importantly that the measures of greedy refinements exhibit 
a stabilization property leading to the equality of the limits for /3 /too- 

Moreover, the two-element Markov partition from example 1.4 of [38] (see figure |lO|) also yields the same result. 
For convenience we denote this partition by TZ" A and we start with (1 x 1,0 2 x l)mi" ■ The first greedy refinement 
leads to (1 x 2, x l)®^" ; the second greedy refinement leads to (1 x 1, x 3)@-r,» ; the third round of three greedy 
refinements leads to (1 x 3, x 4)@k^ 3 , and so on. 

In light of these equalities it is natural to conjecture that this limiting behavior is universal for the cat map (as 
well as for hyperbolic toral automorphisms in two dimensions and even more generally in higher dimensions and for 
constant-ceiling suspensions): i.e., that greedy refinements of any Markov partition TZ produce a (sub)sequence of 
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FIG. 9: (L) 1Z' A0 . (R) 1Z' A1 . Note that the result of a round of greedy refinements is not unique, though the value of the 
resulting entropy is. 



/ I 




FIG. 10: Upper left: TZ Afi . Upper right: Tl' AA . Lower left: TZ A ^ 2 . Lower right: K Ai3 . 



partitions whose measures eventually coincide with those obtained in the same way from any other Markov partition 
TV . We further conjecture that the limit inferior of /3/ioo over greedy refinements with diameter tending to zero is 
the physically preferred inverse effective temperature. 

To understand why this is plausible in the special case of Ta, suppose we have for an initial partition 1Z that 
(1 x c , x Cx,...(f) u x c u )@n- We expect that each of the rectangles of relative measure cj) v can be greedily refined 
into a rectangle of relative measure <f) v and 4> v ~ 2 ] by repeating this, we expect to eventually obtain a refinement 
W s.t. (1 x c ,cf) x ci)@n', with c = c + Y^n=i F m c m +i + F U -2C U and ci = c x + El^™^ + F u-ic u - (Note 
that this greedy refinement TZ' can itself be greedily refined.) So the idea is that the allowable initial relative measure 
multiplicities {c m } appear to be constrained by the Markov condition in such a way that greedy refinements always 
lead to a subsequence for which the relative measures are independent of the initial condition {c m }. 

Although this phenomenon does not appear to admit a straightforward generalization to arbitrary Anosov systems, 
supporting evidence in the context of the geodesic flow on a surface of constant negative curvature (see section XVI ) 
and the variational principle (see appendix [B]) do provide a reasonable basis to assume that its essence holds more 
generally. This also leads to a far-reaching conjecture about the effective statistical physics of nonequilibrium steady 
states couched in the context of SRB measures for Anosov systems. Before we summarize this proposal, however, we 
will briefly turn to the role of effective and actual ensembles. 



XI. ENSEMBLES 



The effective temperature leads to an effective canonical ensemble: in the case of Anosov systems, an effective state 
space is a Markovian coarse-graining of the underlying configuration space. However this underlying configuration 
space can also support an actual microcanonical ensemble (recall that the SRB measure generalizes the microcanonical 
ensemble), for example in the case of geodesic flow on a surface of constant negative curvature discussed in sections 
|XV| and |XVI| While the key to resolving this disconnect has already been described in section |III| the mechanisms 
by which effective and actual canonical ensembles can be brought into closer correspondence will be addressed here. 

In general, from the point of view of effective statistical physics no fundamental distinction is drawn between 
equilibrium and nonequilibrium steady states, provided that a suitable characteristic timescale is given. However, 
systems producing nonequilibrium steady states must necessarily be thermostatted in some way [351 05] . A proposal 
of Gallavotti [2] (see also, e.g., [BT, PT7H5T)] ) suggests that dynamics in the thermodynamical limit should be insensitive 
to the details of the thermostat (i.e., the concomitant SRB measures should tend to the same limit). By extension, 
this "ensemble equivalence" proposal holds that nonequilibrium ensembles enjoy a greater latitude in their definition 
than equilibrium ensembles. 
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A mechanism of particular interest to the case of Anosov systems is provided by the so-called Gaussian isokinetic 
(GIK) thermostat, which preserves the Anosov property. [144] The basic idea of the Gaussian thermostat for an 
autonomous ODE x = F(x) is to project F(x) onto the tangent space T X H of a specified constraint manifold S; 
thus Gaussian thermostats can be employed to fix not only the kinetic or internal energy, but also a field or current 
|32j . The GIK and Gaussian isoenergetic (GIE) thermostats have been shown to be formally equivalent in the 
thermodynamical limit [51 . Thus both the GIK and GIE correspond to the microcanonical ensemble (although 
spatial degrees of freedom are canonically distributed, in most cases of interest this is trivial [S2\). 

However, assuming the validity of the ensemble equivalence proposal and of the chaotic hypothesis, we may simul- 
taneously consider a non-Gaussian thermostat such as the Nose-Hoover (NH) thermostat [S^HMj and retain effective 
Anosov properties in the thermodynamical limit: the chaotic hypothesis has been shown to be applicable to both 
Gaussian and NH thermostats [BJ. Unlike the GIK or GIE thermostats, the NH thermostat corresponds to the canoni- 
cal ensemble in equilibrium, and maintains the internal energy in nonequilibrium. Both Gaussian and NH thermostats 
enjoy the seemingly contradictory properties of determinism, dissipativity, and time-reversibility; however, the NH 
thermostat can be regarded as a generalized Gaussian thermostat in certain contexts. Indeed, the NH thermostat itself 
admits generalization, providing considerable freedom for the construction of a concrete deterministic reservoir that 
yields the canonical distribution in many cases, of which the so-called Lorentz gas provides a particularly well-studied 
example (46] . 

While the dynamics of a thermostatted system appear to depend on the details of the thermostat, thermodynamical 
properties appear not to. That is, the ensemble equivalence proposal is supported by evidence, but may not be 
applicable to equivalence of chaotic properties (such as the oft-noted equality between phase space contraction and 
entropy production rates). In particular, transport properties can vary as a function of the thermostat (though this 
does not seem to be the case with Gaussian or NH thermostats). However, it is conceivable that such variability is 
due to the presence of only a small number of degrees of freedom, and that the ensemble equivalence proposal holds 
in full generality in the thermodynamical limit [46] , 




Even though Gaussian and NH thermostats can be realized in a Hamiltonian framework, in general projecting 
out thermostat/reservoir degrees of freedom will transform Hamiltonian dynamics into non-Hamiltonian dynamics 
|46j . Indeed, it has been argued that the study of nonequilibrium processes requires the consideration of either 
non-Hamiltonian or infinite systems [55] - 

Having discussed thermostats, we therefore briefly turn to the complementary topics of weak coupling and infinite 
systems. A natural system to consider the role of ensembles and the thermodynamical limit in the present context 
is a coupled lattice of Anosov diffeomorphisms, particularly as studied in [3D] [SB]- Aside from the broad range of 
physical applications of generic coupled map lattices [19] . coupled lattices of Anosov diffeomorphisms have recently 
been employed in concert with renormalization group analyses in attempts to understand the origins of diffusion from 
deterministic microdynamics with local conservation laws [57] , 

To tie these considerations into our framework in a more concrete manner through a relevant class of examples, 
write T := (T 2 ) 2 " and T N ■= (T 2 ) v «, where V = V N := {-N, . . . , N} d . The metric on T is given by a weighted 
sum of the metrics on each copy of T 2 , with the weights rapidly decreasing on 1 d . Let A : T — > T be given by 
(Ax)u = Ta%u for u) € % d . That is, A is the Cartesian product of copies of Ta indexed by elements of 7L d . Let An 
be the restriction of A to T/v- If 9 ■ T — > T 2 is well-behaved (e.g., analytic and depending only on tori in some finite 
neighborhood of the origin) and e small, we set (A s x) u :— Tax^ + eg(T- u x), where (t_ u x)( := X- u ^ . That is, A e 
is obtained from A by adding a weak Z d -invariant coupling. Considering periodic boundary conditions on T/v yields 
a concomitant map An,c- 

In [3B] [SB] it was shown that the SRB measure corresponding to A e exists and is well behaved for e sufficiently 
small, and that it equals the weak limit of the SRB measures of An,s- Moreover, if the coupling is not degenerate in 
a certain technical sense, then the projected SRB measures of An,e and A e are absolutely continuous w/r/t Lebesgue 
measure (which is the projection of the SRB measures of An and A onto T 2 ). In practical terms this means that 
the projected SRB measures of An, s and A e can be given in terms of probability densities w/r/t the projected SRB 
measures of An and A. 

Before concluding this section, we will briefly discuss the key technical observation at the root of these constructions. 
It is easily seen that the Cartesian product of Markov partitions is again a Markov partition for the associated product 
map, and as we shall sketch, a small perturbation of an Anosov diffcomorphism is again Anosov and has a Markov 
partition that can be simply characterized. 

A basic property of an Anosov diffcomorphism T is structural stability, i.e., the property that any T" sufficiently 
close to T in the natural metric is topologically conjugate to T. Let T" be a small smooth perturbation of T and U 
the corresponding topological conjugacy, viz. J7 _1 T'[/ = T. It turns out that if 1Z is a Markov partition for T then 




UWf T) {x,Rj) = Wf TI) {Ux,URj) 



(61) 



and similarly for the unstable manifolds. 
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To see this, let x' € WfrJx, Rj), i.e. x' e Rj and d(T n x,T n x') | 0. Then Ux' £ UR 3 and 
d{U- 1 T' n {Ux),U-' L T' n {Ux')) | 0. By continuity, d(T' n {Ux),T' n (Ux')) | 0. So Ux' G (Ete, f/i^). Now 

C/FT ( S T) (a;, = Wf TI) (Ux, URj) implies that 

T'UWf T) (x,Rj) = T'W^iUx^URj) c UWfa(U- l T'Ux,R k ) = Wj^T'L/x, J7iJ fc ). (62) 

As a consequence of this and the analogous result for unstable manifolds, UTZ is a Markov partition for T". 

In particular, and without delving into the issue of explicitly constructing the topological conjugacy if it is not 
already given (but see, e.g. [58 for general considerations along these lines and [20] for an explicit treatment in the 
case of a coupled cat map lattice), a Markov partition for T4 can therefore be used to obtain a Markov partition for 
A £ . Since as we have seen the "thermodynamical limit" of finite lattices of coupled cat maps is well-behaved, we can 
in principle consider the dynamics on a reduced space (such as Tn) from the point of view of the (actual) canonical 
ensemble. 

The details of an analogous construction for Anosov flows are presently unknown, though it is reasonable to 
anticipate that one should exist. The paradigmatic example of an Anosov flow (also considered later here) is the 
geodesic (Hamiltonian) flow on a surface of constant negative curvature. The primary obstacle is that a nonzero 
coupling destroys the invariance of the individual uncoupled Hamiltonians, thereby precluding uniform hyperbolicity 

[23155]. 

In summary, while the application of effective statistical physics to infinite systems in general requires care, con- 
siderations of both the thermodynamical limit and of projected thermostatted dynamics appear to be tenable in the 
present context. In the same spirit, while some care is clearly required in discussing the role of effective and actual 
ensembles, in the presence of a weak external coupling or thermostatting the conceptual distinction between effective 
and actual ensembles can reasonably be expected to range from subtle to nonexistent. 

On this basis our framework can be viewed as a proposal for a significant extension of the program initiated by 
Ruclle for a general theory of nonequilibrium statistical physics. We now turn to the explicit statement of this proposal 
before corroborating it in the other paradigmatic-and more physically relevant-example of an Anosov system. 



XII. THE PROPOSAL IN A NUTSHELL 



In the preceding sections we have couched the chaotic hypothesis in the context of effective statistical physics. On 
the basis of results obtained above (and corroborated in section XVI) for greedy refinements, we have outlined the 
key elements of a proposal for a comprehensive framework for nonequilibrium statistical physics that simultaneously 
incorporates and extends the formalism originally introduced by Ruelle and subsequently refined by Gallavotti and 
coworkers. Besides introducing several new ideas in this paper, we have also provided evidence to support a nascent 
theory of statistical physics that is truly intrinsic: i.e., that provides a general framework in which not only entropy 
but also concepts like (effective) temperature and energy can be understood simply in terms of raw temporal infor- 
mation about the dynamics, and without reference to a predetermined Hamiltonian (while bearing in mind that if a 
Hamiltonian exists, then it can be reconstructed using this framework). 

The proposal, in a nutshell, is that 



classical physical systems in either equilibrium or nonequilibrium steady states and with a well-defined 
characteristic timescale can be described using the idiom of equilibrium statistical physics. For many- 
particle systems the chaotic hypothesis is assumed to hold. For Anosov-like systems is the mixing or 
relaxation time, and there is a conjecturally unique and well-defined inverse effective temperature given by 
the infimum over initial Markov partitions of the limits inferior of the inverse effective temperatures over 
greedy refinements. This inverse effective temperature may be used to (re)construct an effective energy 
function by a suitable application of the Gibbs relation. A generalization of the variational principle (see 
appendix |B]) is conjectured to hold in which the effective free energy is minimized. 



Note that the proposed characterization of the preferred inverse effective temperature may be stronger than necessary, 
since for the cat map the initial Markov partitions that we have examined all lead to the same result. 

It is reasonable to speculate that a relationship effectively of the sort indicated by the following commutative 
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diagram might hold: 



TZ TZ' 




g(nvn') 



Here TZ and TV are arbitrary Markov partitions, the notation Q indicates greedy refinement, and the dashed arrows 
are the speculative part. If some greedy refinement Q(TZ\/ TV) of TZ V TV were also a greedy refinement of both TZ 
and TV , then the process of taking greedy refinements would be asymptotically unique. This in turn would provide a 
strong structural framework in which limits of the sort we have considered could be dealt with in great generality. 

Establishing a relationship along such lines would likely involve the space of all Markov partitions, and in particular 
the realization of this space as a contractible simplicial complex [BT)H6"3"] . Indeed, ordered triples of the form (TZ, TZ Vt 
Rj,TZ V T~ l TZ) correspond to certain triangles in this simplicial complex, as do more general triples in which the 
middle entry is replaced by the result of a suitably constrained sequence of minimal refinements. In this setting the 
aim is to construct two sequences of edges, each corresponding to greedy refinements of an initial Markov partition, 
that eventually coincide. 

Recent work aimed at characterizing associated spaces of Markov partitions for a generic two-dimensional hyperbolic 
toral automorphism [64 66 could serve as a basis for developing the necessary ideas in a concrete setting. This is of 
particular importance from the physical point of view since a complete theory of greedy or even minimal refinements 
might require the incorporation of abstract constructions from algebraic topology and algebraic K-theory [61] 163| 167] . 

A few points bear enumeration: 

• While the detailed results obtained here for prototypical Anosov diffeomorphisms (viz., two-dimensional hyper- 
bolic toral automorphisms and, as discussed in section |XVI| the geodesic flow on a surface of constant negative 
curvature) are not by themselves obviously fundamental and we do not presently have anything resembling a 
mathematical proof of the sort ultimately required, our analysis of Markov partitions in the light of the chaotic 
hypothesis provides strong motivation for the applicability of our framework for effective statistical physics 
to generic nonequilibrium steady states, incorporating and extending the Ruelle program. This analysis also 
suggests additional nontrivial constraints on the form of the effective temperature. 

• The relevant objects and quantities involved can all be obtained from dynamics alone, yet without external 
reference to a Hamiltonian. The existence of a dynamical basis for framing the effective temperature in relation 
to a broad class of systems of interest for statistical physics suggests its physical relevance [5J [5] • The important 
aspect of its application is to start with Markov partitions of M, and obtain after greedy refinements the 
concomitant effective temperatures, then to identify the appropriate extremal limiting effective temperature 
and its concomitant effective energy function, then select any Markov partition such that the effective energies 
can be treated as constant on its rectangles and apply our extension of the Ruelle framework. 

• The simultaneous use of greedy-and therefore more nearly regular-microscopic and mesoscopic Markov par- 
titions can be done self-consistently in the presence of a reservoir or thermostat, or in the thermodynamical 
limit. It turns out in fact that such considerations lead to estimates on the mixing time of an Anosov system, 



which is a physically natural choice for (see sections III and XIII I. This means that in the event that the 
system has a known Hamiltonian, the approach in this paper will (at least up to an overall timescale, if not 
manifestly) facilitate the recovery of traditional equilibrium statistical physics. It is for this reason that we can 
plausibly claim that our proposal suggests an extension of the Ruelle program to a potentially complete theory 
of statistical physics for nonequilibrium steady states. 



Now that the basic ideas have been sketched, we will briefly detour to discuss the Ulam method before continuing 
with continuous-time systems. 
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XIII. THE ULAM METHOD 



While coarse-graining Anosov systems via Markov partitions is entirely natural, a simultaneous fine-graining is 
both anticipated and necessary from the point of view of statistical physics [2] [30] ■ It is reasonable to expect that the 
SRB measure should be obtainable from a fairly generic fine-graining and that Markovity should be in some sense a 
technical convenience in this regard. The present section aims to illustrate how this is likely to be the case while also 
proposing a recipe for approximating the mixing time, which as we pointed out in section |Hl| is a physically natural 
choice for t^. 

In order to explain how this can be, some background is necessary. The Perron- Frobenius or transfer operator V 
(not to be confused with a partition) is given by J x V f — f T -i x / jll]. The SRB measure is given by an eigenvector 
of V with unit eigenvalue; the relaxation time (which can generally be expected to be related to the mixing time) 
is given by the second largest eigenvalue of V . The Ulam conjecture [55] is that the key spectral properties of the 
Perron- Frobenius operator can be obtained by a finite-dimensional approximation. Although open in its general form, 
the Ulam conjecture has been established for important special cases and supporting numerical work performed in 
others (a recent survey is [69]). 

The Ulam approximation (see also §17.5 of [T5J) is a so-called Galerkin method, and starts from a (typically 

non-Markov) partition S m — {s[ m \ . . . , si™**} and the Riemannian measure v. The matrix P^ ] [S] := v(Sj n 
T~ 1 Sk)/v{Sj) is clearly stochastic, and as such admits a corresponding invariant distribution n'™ 1 ' = T\( m )p( m ) . If 
one takes Markov partitions lZ m with the diameters of rectangles tending to zero, the corresponding sequence of 
probability measures P m converges weakly to the SRB measure for two-dimensional 

Anosov systems [70] . More generally for Markov partitions for two-dimensional Anosov systems, the isolated spectra 
of p( m ) approximate the isolated spectrum of the Perron-Frobenius operator for the projection of T onto the unstable 
boundary of 1Z. This result can be used to obtain good bounds on the mixing time, which we have identified as a 
natural choice for t^ [71] . 

Though a satisfactory explanation of the numerical effectiveness of the Ulam method does not exist at present [75] , 
it succeeds in considerable generality for approximating SRB measures [73Tl75| . It should be noted that the success 
of the Ulam method in approximating SRB measures does not entail success in approximating the corresponding 
Perron-Frobenius operator itself: indeed, for non-Markov partitions the Ulam approximation can fail to provide a 
reasonable approximation of the Perron-Frobenius operator of the cat map. However it has been argued that such 
failures are due to a lack of subunit isolated spectra of the Perron-Frobenius operator, and that such a situation 
should preclude the expectation of any reasonable spectral approximation in the first place [75] . 

In short, the Ulam method appears (but has not yet been proven) to provide precisely the tool required in order to 
dynamically obtain the SRB measure in a physically relevant way without reliance on Markov partitions, as well as 
for providing (bounds on) a suitable t^. 

This is especially noteworthy when one considers that not only is the construction of ((J<srb-) regular Markov 
partitions (or even a sequence of Markov partitions such that the Riemannian measures of rectangles tend to equality) 
unlikely, but more remarkably that the boundaries of Markov partitions are generically (and even in the case of 
hyperbolic toral automorphisms in more than two dimensions are always) fractal [13] . so that Markov partitions are 
intrinsically difficult to construct precisely. This latter fact suggests that in practice approximations of the SRB 
measure will generally be necessary in any event. 



XIV. THE CONTINUOUS CAT MAP AND THE CAT FLOW 



Having already discussed Anosov diffcomorphisms, we shall proceed to introduce Anosov flows. As a preliminary, 
we first briefly touch upon a continuous version of the cat map that informs the simplest Anosov flow. 

Iterates of Ta correspond to unit-frequency stroboscopic projections onto T 2 of trajectories on M 2 generated by the 
Hamiltonian H A (£,p) := K(p 2 - £ 2 + fp), where K = sinh _1 ( v / 5/2)/\/5. To see this, consider for the moment the 
more general Hamiltonian of the form H(£,p) = ^p 2 — y£ 2 + E£p. Now Hamilton's equations are x — Cx, where 
x = (|) and the Liouvillian is C = ( £ -e)- Writing L := \J E 2 + CD, we have that 

„ /cosh L + ^ sinh L ^sinhL \ ,„„. 
^ C =[ £ sm hZ coshi-fsinhij- (63) 

Now a bit of elementary algebra leads to log [ 2 \) — K { \ -i) = £a, so the corresponding Hamiltonian is Ha- 

The restriction of the associated Hamiltonian flow to [0, l] 2 is shown in figure 11 Projecting this restricted flow 



onto the torus results in a singular flow, called the naive cat flow. (Note that the projection of (0, 1/2) is a repeller.) 
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On the other hand, projecting the Hamiltonian flow on M 2 onto the torus does not result in a flow at all, but it 
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FIG. 11: The naive cat flow. 

does produce a continuous-time evolution x(t) = Tr(e tc x(0)) called the continuous cat map because of the equality 
x(e)=Tix(0). 

The cat map can be used to construct perhaps the simplest Anosov flow; the continuous cat map can help visualize 
this flow. Take the product T 2 x R and introduce the equivalence relation (Tax, z) <~ (x,z + l). Write M. := T 2 x K/ ~ 
and consider the flow on M. generated by the vector field e z . This "suspension" of the cat map is called the cat flow. 
It is an Anosov flow for M. [77J HE] with Anosov metric 

ds 2 = X 2 + Z dx 2 + + X 2 _ z dx 2 _ + dz 2 . (64) 

Given a Markov partition 1Z — . . . , R n } for the cat map and m > 3, consider the sets Rjk ■= Rj x {-^ — je}, 
where 1 < k < m and e < — . The section TV := {Rjk}j,k is readily seen to be a proper section for the cat flow [79] . 
The Poincare map for 1Z' sends Rjk to Rj.k+i for 1 < k < rn — 1, and because 1Z is a Markov partition for the cat 
map it follows that 1Z' is a Markov section for the cat flow. Theorems apply for Markov sections of Anosov flows that 
are similar to those for Markov partitions of Anosov diffeomorphisms |80) and as such the present situation may be 
analyzed in a manner essentially similar to that of previous sections. 

Indeed, for the characteristic times t[x) = (tx(x), . . . ,t n (x)) = t QO p(x) w/r/t 1Z of a finite segment of the orbit of 
x under the cat map Ta, the corresponding set of return times for the cat flow w/r/t the Markov section 1Z' tends 
to (t/m)® m as e I 0. (Here the m-fold concatenation of t/m is indicated: i.e., for s £ R", sf m := su r _u moc i „j + i 
for 1 < r < mn.) The inverse effective temperature corresponding to t is f3(t) — foo||p|| ■ VIItII 2 + 1; similarly, 
f3((t/m)® m ) = too||p|| • ^/m||7|| 2 + l/y/m. |145j Therefore (unless p is approximately uniform, in which case 7 ~ 0), 
P ( (t/m) ® m ) w (3(t). 

This shows that the effective temperature of the cat flow w/r/t the Markov section 1Z' is qualitatively similar to the 
effective temperature for the cat map w/r/t the Markov partition 1Z. The adaptation of results concerning hyperbolic 
toral automorphisms to the corresponding flows is straightforward. 

Unlike the cat map, however, the cat flow is not mixing due to the so-called Anosov alternative, which states that 
an Anosov flow is either a suspension of an Anosov diffeomorphism with constant ceiling or every strong stable and 
unstable manifold is everywhere dense, in which case it is mixing [14] . The cat flow is the prototypical example of 
such a suspension, and for this reason its utility as a model system for statistical physics is limited. Nevertheless, the 
cat flow is considered in magnetohydrodynamics as a kinematic fast dynamo, i.e., an MHD system with prescribed 
velocity field supporting the exponential amplification of the magnetic field for small magnetic diffusivity [78] . The 
geodesic flow on a surface of negative curvature is also a kinematic fast dynamo, but it also represents the other, more 
physically relevant possibility under the Anosov alternative: we deal with it next. 



XV. THE GEODESIC FLOW ON A SURFACE OF CONSTANT NEGATIVE CURVATURE 

In this section we introduce the other reasonably accessible example of an Anosov flow, and one that is mixing (and 
moreover, motivated the theory of Anosov flows): namely, the geodesic flow on a compact Riemannian manifold of 
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dimension > 2 with negative sectional curvature [5T] (see also [HI])- We then construct a map topologically conjugate 
to its Poincare map and a corresponding Markov partition. 

The physical interpretation of the geodesic flow is given by the free-particle Hamiltonian [83J 

H =-^T,9 ik {x)PiPk, (65) 

where as usual {g-* k ) is the inverse of the matrix (gjk) describing the Riemannian metric ds 2 — k 9jkdx J dx k in 
local coordinates. Although the construction of symbolic dynamics for such geodesic flows dates to Hadamard [84] 
and Morse [55], the fullest expression of the situation in the case of constant negative curvature is given in the 
complementary works of Bowen and Series [SB] and Adler and Flatto [57] ; we borrow liberally from the latter in the 
discussion below, which we have sought to make relatively self-contained at the expense of length. For a qualitative 
overview of the constructions involved, see [88j . 

Consider the Poincare disk model of two-dimensional hyperbolic space, i.e., the complex unit disk D endowed 
with the metric ds 2 = dzdz/(l — \z\ 2 ) 2 . In this model geometry, the geodesies are circular arcs intersecting the 
boundary S 1 of D at right angles. Useful coordinates on the unit tangent bundle are u = (£, T), s), where £ and r\ 
are respectively the terminal and initial ends of the geodesic determined by u, and where s is the hyperbolic distance 
from the midpoint of that geodesic to the base point of u in D. In this coordinate system the geodesic flow takes the 
form (£,77, s) -> {£,,1], s + t). 

We have for g > 2 and N = 8g — 4 the existence and (among others) the following properties of a bounded 
regular hyperbolic iV-gon F and of a corresponding discrete group T of Mobius transformations (i.e., maps of the 
form z M> (jj) ■ z :— ^rg , for ad — be ^ 0) that preserves D: 

• the translates tF for r G T tile D and have pairwise disjoint interiors, so that D/Y is well defined; 

• there is a unique element of Y mapping a specific pair of edges of F to each other, and gluing the paired edges 
of F produces a compact surface of genus g that can be identifed with D/Y; 

• the geodesies extending the boundary OF of F are contained within the translates rdF for r £ Y (this is called 
the "extension condition"). 

The Hamiltonian for the geodesic flow on D/Y takes the form (up to a constant) 

H = (1 - \z\ 2 ) 2 ■ PzPs = (1 - r 2 ) 2 • p 2 (66) 
where Euclidean coordinates and momenta on F are implicit in the RHS. 




FIG. 12: A surface with genus g = 2. For g > 2 there are surfaces of constant (or more generally global) negative curvature and 
whose geodesic flows are mixing (hence also ergodic). The construction of Adler and Flatto provides examples. For g = 2, the 
corresponding 12-gon can be recovered by cutting along the indicated paths. Incrementing g by adding a handle also requires 
adding two more geodesic loops; one of the new loops and one of the old loops are separated into two arcs by their intersections 
with neighboring loops, and cutting along these arcs yields eight new edges. This construction underlies the pairing of edges 
of F, which is indicated explicitly here for g — 2. 

An explicit description of the iV-gon F is 

N 

F = D \ (J ( v^T ■ D + ^e 27 ^'- 2 ^) (67) 
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with a = sec 2 S-. The edges inherit the implied index (in particular, we shall consider any indices deriving from edges 
of F to be taken modulo N) and the pairing of edges Sj turns out to be given in terms of 

= | (4, - i) mod AT, i0 dd 
I (2 — j) mod N, j even 

by the existence of unique elements Tj G T satisfying Tj(sj) — s~}.\, where sj 1 is the orientation reversal of Sj (see 



figure 12) 



The construction of a "rectilinear" map Tr that is topologically conjugate to the "curvilinear" Poincare map for 
the geodesic flow on D/T is tractable and will be detailed below. In particular, an explicit Markov partition for Tr 
will be constructed, and for this reason we shall restrict our attentions to this map. Given an edge Sj of dF, extend 
it to a geodesic. Write aj and a~j respectively for the initial and terminal ends of this geodesic. (NB. [ST] uses the 
notation aj and bj instead.) A line of algebra shows that the a ■ are given in terms of 



9 J ± :=^(i-2 9 )±cos- 1 (- A /^] ((H)) 



N 

(where the principal branch of the inverse cosine is assumed) by 

af = v^T • e w f + Va~e 2 ^- 2 ^/ N . (70) 

The sections for both the curvilinear and rectilinear Poincare maps are specified in terms of the , which occur in 
the (cyclic counterclockwise) order a^ , ,a 2 , . . . , ajj, ajv-i' Because Tj is a Mobius transformation and as such 
maps circles to circles, it is easy to show that 



T 
± j 



4 * <u) 



a U * <(,) + ! ( 71 ) 



if -i i ^ a , t\ 

3- 1 ct(j) 
a j + l ^ a vU)-l 



Let lj := (a,j , aj~) and Uj := (a^ } a^ ). It follows that Tj x Tj sends (for example) tj-\, ij, £j+i, Uj-it Uj, Uj+i 
respectively to t a (j)+i, u a (j), «o-Cj)+1) u o{j)-i- 

Write £ = e lx and r\ — e ly . Given two distinct points (£,17), (£',?/) m -S* 1 x S 1 , let ((£,27), (£',?/)) denote the path 
from (£, 77) to (£', 77') obtained by always keeping one argument fixed, first increasing the second argument ("up") and 
subsequently the second argument ("right"), i.e. 



(£, T]e lt ), < t < (y 1 - y) mod 2tt 

(£e 4i , 77'), (7/ — y) mod 2tt < t < (y' — y) mod 27r +r (x' — x) mod 2ir 



where the +r serves as an indication of the usual addition over K.. The domain R of the rectilinear map Tr is 
defined using the closed concatenation Ir of paths (£j,£j + i) and the closed concatenation Ur of paths (uj, Uj+i). 
Specifically, let R' be the region in S 1 x S 1 bounded from below by Ir and from above by ur. Define R := 
R' \ [t-R U {ui} U ■ ■ • U {un}) and let R* be the subset of pairs (£, 77) in R such that £ is in the counterclockwise arc 
from aj to aj +1 , with the latter point excluded. Refine the partition {R*, . . . ,R* N } of R by dividing each R* into 
two rectangles, with the division along a segment of constant first coordinate (note that there is only one way to do 
this) and denote the result by 1Z. One of the main results of [87] is that 1Z is a Markov partition for Tr (see figures 



13 and 14 1: we now turn to the construction of this map, eliding its derivation. 
It turns out that Tr is given by Tr(^,t]) :— (Tj(£),Tj(r])) for (£,77) S R*. We can construct Tj and hence Tr as 



follows. Because a Mobius transformation is determined by its action on (almost) any three points, using ( 71 ) allows 
the explicit construction of Tj. Specifically, recall that the Mobius transformation sending z±, z 2 , and z 3 respectively 
to 0, 1, and 00 is z t— > $j(zi, Z2, z 3 ) • z, where 

iKz!,^):^ 2 ^ 3 ! - Z ^- Z ^) (73) 
v ' ' ' \(z 2 -zi) -z 3 (z 2 -z 1 )J v ; 

and as usual the action of ( a c \ ) 6 GL(2, C) on z is given by ( " h d ) • z — ■ It follows that the Mobius transformation 
sending z\ , z 2 , and Z3 respectively to z[ , z' 2 , and z' 3 is 

z^r [Sj-^zi, z' 2 ,z 3 )F>(z l7 z 2 ,z 3 )} -z. (74) 
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Markov partition for T R 



Image under T R of the Markov partition forT R 





FIG. 13: (L) 1Z for g — 2. (R) TrTZ. Rectangles are indicated by consistent shading. 



Image under T R of the Markov partition for T R 



Image under T R of the Markov partition for T R 





FIG. 14: (L) TftlZ. (R) Tr1Z. Rectangles are indicated by consistent shading. Note that here the rectangles are not connected, 
though taking connected components yields refined Markov partitions. 



In particular, substituting appropriate points immediately yields Tj and completes the construction of Tr. 
For example, if g = 2 it can be shown (with some difficulty, or verified numerically with ease) that 

T 2g -iz = h- z, 



(75) 



where we exploit the fact that a constant multiple of a matrix in GL(2, C) gives rise to the same Mdbius transformation 
to write 



,271-1/12 



_ p -2tti/12 



(76) 



and here c :— (3/4) 1 / 4 . We thus obtain in this case the explicit formula 



rp IX _ IX 



ce~ ix — e 27 ™/ 12 

QfAx g— 27ri/12 



(77) 



This is graphed in figure |15[ 
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FIG. 15: The action of Tig-i on S 1 with g = 2 is highly nonlinear. The restriction of its domain and range as a building block 
of Tr on the first coordinate is indicated with dotted lines. The nonlinearity is also evidenced in the T^TZ shown in figure (14) 



XVI. SRB MEASURES OF MARKOV PARTITIONS FOR THE GEODESIC FLOW 

In this section we outline a numerical calculation of the inverse effective temperature along similar lines as for the 
cat map. Up to normalization, the SRB measure of a rectangle (whose points satisfy x > y) is given by 

""' / J dx dy . (x 2 ~y 2 \ , . (x 2 -yi\ . . (x l -y 2 \ . (x 1 -y 1 \ 

= log sin - log sin - log sin + log sin . (78) 
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e 
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Denoting this by n([x 1 ,x 2 ) x [yi,y 2 ]), we can see that fi([xi,x 2 ] x [yi,y 2 ]) = (i([xi + A,x 2 + A] x [y x + A,y 2 + A]). 
This partial translation invariance allows us to restrict our attention to subrectangles contained in two of the 2N 
rectangles in 1Z. The most convenient choice is the (2g — l)th pair, i.e., the two rectangles contained in R 2g -\ sharing 
an edge with £ = aj ' 2 an d that are always connected in the image of the usual map from the torus to [0, 2ir} 2 . Write 

Y := (a 2g , ajg+iJ • • • > a ~N> a i~i • • • i a tg-4i a 2g-2i a 2g-3' a 2g-i)- Now the ^-coordinates of the subrectangular boundaries 
are essentially determined by suitable translations of the coordinates obtained by iterates of T 2g _i on Y . 

This fact facilitates a numerical calculation of the measures of partition refinements along the same lines as the 
previous ones for hyperbolic toral automorphisms. In the simplest case we consider refinements of the form 72.^. The 
results of this calculation indicate that /3 /too diverges nearly exponentially as a function of the partition refinement. 
Although the vast growth in the number of rectangles precludes the use of refinements with more than a few iterates 
of T R , for Ta it suffices to consider only a few iterates in order to observe adequate numerical evidence of convergence, 



bolstering our view. The divergence can be ascribed to the nonlinearity of the Tj (see figure 15), which produces 



rectangles in partition refinements with widely varying measures (see figure 16 and recall section VII I. Proportionately 
fewer rectangles dominate the probabilities as m increases, and the rectangles with comparatively large measures cause 
||7|| to grow faster than ||p|| decays. 

Because Tr and Tq have the same invariant measure and the topological conjugacy between them turns out to be 
built from isometries, the phenomenon described here applies to Tc as well. 

However, considering greedy refinements substantially changes this picture. A greedy refinement of 1Z entails N — 1 
subsequent greedy refinements for the unaffected rectangle pairs R*, and this pattern persists for later stages of 
greedy refinement as well. But each individual greedy refinement amounts to a boundary cutting across the images 
of multiple pairs R* , so the resulting control over measures of rectangles is more significant than might otherwise be 
anticipated. 

In fact this point of view suggests that the cat map may actually be a sort of "worst case" for the controlling 
behavior of greedy refinements (at least for hyperbolic toral automorphisms in two dimensions) , since the images of 
rectangles so obtained will only map across two rectangles in the unstable direction. It is tempting to speculate that 
may be related to the well-known fact that <fi has the slowest converging continued fraction expansion of any irrational 
number. 

Numerics clearly suggest that the limit inferior of /3/too over greedy refinements is positive and finite: see figure 
The computations are greatly simplified by the trivial observation that nsRB{Rj H T~ 1 Rk) — Hsrb(TRj fl Rk)- 
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Combining this with the other convenient facts mentioned in this section and using T 2 „_\Z = h 1 • z facilitates the 
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measures ol rectangles for geodesic flow 



m=2 
m=3 



sorted and rescaled 



FIG. 16: Sorted measures of rectangles in successively refined partitions of the form 1Z^ } 




10* to" 
greedy refinements (units of N) 



FIG. 17: PI too for TZr with g = 2,3, 4 as a function of greedy refinements (in units of N). Note the logarithmic horizontal 
scale. This graph strongly suggests that liminf /3/too is both nonzero and finite, and also independent of g. Inset: Normalized 
entropy. Note that convergence to 1 is visible but slow. 

use of a simple data structure for the construction of greedy refinements and computation of the measures of their 
constituent rectangles in silico. 



XVII. COMPARISON WITH THE DYNAMICAL TEMPERATURE 



The continuous cat map and cat flow do not allow for the self-consistent application of the dynamical temper- 
ature introduced by Rugh [THl HI]. For a Hamiltonian system ergodic w/r/t the Liouville measure restricted to 
a nonsingular energy surface, the inverse dynamical temperatu re Pdm is given a.s. by the time average of (e.g.) 
<f> = V • (V-ff/|| ViJ|| 2 ). For the continuous cat map from section XIV it is easy to see that $ = — 4H/5Kx 4 , but the 



requirements for the dynamical temperature construction to apply are not met. This breakdown is due to the fact 
that the constant-energy curves of H are poorly behaved. In particular, the energy is only piecewise constant along 
trajectories, which are themselves nondifferentiable at points in n(d([0, l] 2 )) and repelling from points in 7r((0, 1/2)) 
(more generally, the cat map is not integrable). The additional fact that <&(x) > <^=> x 2 > {(f) — 1)^1 > where 
x is taken to be in [0, l] 2 C R 2 only serves as a further complication. It is also easy to see that <I> (with 

$ = V • (ViT/H ViJ|| 2 )) diverges to — oo logarithmically w/r/t e. Because the SRB measure for the cat map (or 
any hyperbolic toral automorphism) is just the pushforward of Lebesgue measure, it is clear that the Rugh formula 

Pdyn{E{x)) = lim t ^oo i" 1 J * (E>(x(t)) dr for the inverse dynamical temperature cannot be sensibly interpreted in this 
context. 
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In the case of Hamiltonian geodesic flow, experience suggests-and an explicit computation in the cases considered 
here shows-that <E> is inversely proportional to H, and so f3dyn is too. It follows that Pdyn and /3 have the same scaling 
behavior [33] provided that t x depends explicitly on the continuous evolution (as for example if t^ is taken to be 
the mixing time). Although j3 cannot be expected to recover the proper constant of proportionality ab initio, it can 
be so adjusted and in other respects it will automatically reproduce the key behavior of the classical and dynamical 
inverse temperatures. 

While the dynamical temperature has been evaluated in the context of a Henon-Heiles oscillator (HHO) [5Dj and 
a limited symbolic dynamical structure has also been identified for the HHO at the critical energy E — 1/6 [91] , the 
prospects for evaluating the effective temperature in this context are presently poor for a number of technical reasons. 

Nevertheless, the preceding considerations highlight the comparative domains of utility (versus applicability) for 
the effective and dynamical temperatures. If a (well-behaved) Hamiltonian system is given, then the dynamical 
temperature is usually appropriate (see, e.g. |92|). though the effective temperature will reproduce the same results 
(up to an overall choice of scale) provided it can be properly applied. If on the other hand a non-Hamiltonian 
dynamical evolution (such as is present in the canonical ensemble) or raw data in the form of symbolic dynamics are 
given, then the effective temperature is an appropriate generalization of the usual equilibrium temperature that also 
permits the identification of an effective energy function. 

It is particularly noteworthy that we have demonstrated the extraction of an effective temperature even in the 
microcanonical ensemble without any reliance on knowledge of the Hamiltonian beyond a constant energy surface, 
whereas the dynamical temperature requires this |89j . Also, because the derivation of the dynamical temperature rests 
on the relation (3dyn = dS/dE and there is no generally accepted physical definition of entropy for non-equilibrium 
steady states [3 1 1 146] , it is far from clear how or if the dynamical temperature can be generalized to the nonequilibrium 
case. 



XVIII. COMPARISON WITH THE FLUCTUATION-DISSIPATION EFFECTIVE TEMPERATURE 

The use of an effective temperature in glassy systems has a long history '.)'■'> 9~> and has recently gained prominence 
through the fluctuation- dissipation (FD) temperature in mean-field systems [UJ. In this section we discuss the nature 
of the FD temperature and its possible relationship with our effective temperature. 

Recall that the connected correlation function of an observable m is C(t,t w ) := (m(t)m(t w )) — (m(t)) (m(t w )} and 
the associated response and susceptibility are respectively given by 



Sh(t w ) 



and x(t,t w ):= / R(t,t') d£ ', (79) 

h=0 Jtu, 



where as usual the field h is conjugate to m and t w is the "waiting time" of the system at which h is first applied. 
In equilibrium the system is stationary and so we can write C{t,t w ) = C(t — t w ) and similarly for R and \. In this 
context the fluctuation-dissipation theorem (FDT) is a relation of the form 

d d 
- jz-x(t - t w ) = R(t - t w ) = (3 — C(t - t w ), (80) 

where to avoid confusion we write /3 Q to denote the true or "ordinary" inverse temperature. 

The FDT is typically violated in nonequilibrium; however for mean-field systems the nature of the FDT violations 
permits the definition of a timescale-dependent FD temperature that is observable-independent and agrees with a 
thermometer tuned to the appropriate timescale. Towards this end, the fluctuation-dissipation ratio X is defined via 

d d 
- ~Ki-X(t,t w ) = R(t,t w ) = l3 X(t,t w )—C(t,t w ). (81) 

The enabling result of justifying the definition of the FD temperature for mean-field systems is that if t and t w are large 
then X(t,t w ) R3 X(C(t,t w )) [TT]. In this context the inverse FD temperature is defined by /3FD{t 7 t w ) := f3 Q X(t,t w ). 
The functional dependence of the /3fd so defined is considered to be an asset in the context of glassy dynamics for 
mean-field systems, since the well-known separation of timescales in glasses 96 suggests the coexistence of multiple 
(effective) temperatures and the dependence on t w is expected due to nonstationarity or "aging" properties [97] [§5] . 

For systems other than mean-field glasses it has been found necessary to restrict consideration to an inverse FD 
temperature defined by Pfd,co '■— Aoo/3 , where X^ = linii^oo lim^oo X(t, t w ). This restriction emerged from the 
fact that /3pr}(t,t w ) exhibits dependence on observables for trap and one-dimensional Glauber-Ising models [355]. In 
light of this result the domain of applicability of the restricted FD temperature has been extensively studied, and 
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we provide a brief sketch of some representative results. Observable dependencies of Pfd.oo are still present in the 
one-dimensional Glauber-Ising model (although these dependencies disappear in two dimensions) and it is necessary 
in this case to restrict consideration to large spatial scales since at small scales the FDT still applies (100] . By the same 
token, while within the Gaussian approximation there is a well-defined for an O(N) model, beyond the Gaussian 
approximation this ceases to be true |101j . In high dimensions the quenched Glauber-Ising model also exhibits two 
distinct values of X^, depending on the initial magnetization, though there is no observable dependence for short- 
ranged interactions [102) . The Fredrickson- Anderson and East models [103 as well as the backgammon model [104] 
display negative values of X^, precluding an interpretation in terms of temperature. An exactly solvable lattice model 
in which energy is conserved also admits a definition of temperature along the usual lines (viz. f3 a = dS/dE) that differs 
from Pfd |105[ H06] . Finally, recent work indicates observable-dependence in a class of stationary nonequilibrium 
systems even in mean-field (but excluding any aging systems). In this event a signature of the breakdown is a 
difference between the Shannon entropies of the nonequilibrium state and of the equilibrium state with the same 
average energy (1071 1108] , While it has been suggested that at least some defects in applicability for /3fd might be 
remedied by considering nonlinear generalizations of the FDT |109H111] , it is nevertheless apparent that the domain of 
applicability of Pfd as an effective temperature is limited |112j . even as its interpretation remains to be fully realized. 

Although /3fd carries a clear physical significance and interpretation for mean-field glasses, these systems are 
fundamentally ill-suited for direct characterization via a single inverse effective temperature ft. On a basic level this 
can be ascribed to nonstationarity and separation of timescales, mandating consideration of a parametrized family of 
inverse effective temperatures. Non-mean-field glasses exhibit not only these features but also spatial and temporal 
dynamical heterogeneity, which serve as additional complications [1131 1114] . Moreover, it has been observed that 
the long-time dynamics of spin glass models beyond mean-field are invariant under generic time reparametrization 
(and a weaker class of reparametrization still holds for critical coarsening systems |115j ). which further indicates the 
difficulties in applying our framework directly to glassy dynamics in general (1 161 1117] . 

Finally, it is not at all obvious how to obtain a description of the dynamics of such systems that is appropriate 
for our framework. It is worth explicitly recalling the main theme of this paper: namely that even in the case of 
Anosov systems, where as we have seen a natural class of discretizations is provided, it still appears necessary (but 
also sufficient) to put in significant effort in order to obtain a physically reasonable value of the effective temperature. 

This is not to say that Pfd and /3 cannot be related, but only that the existence or nature of any such relationship 
is not clear. The existence of such a relationship seems plausible since for an oscillator coupled to a nonequilibrium 
environment, accounting for a frequency-dependent Pfd m the definition of entropy production rate causes the 
(Gallavotti-Cohen) fluctuation theorem to hold [112] . Meanwhile based on the evidence in the present paper it is not 
unreasonable to speculate that a parametrized family of inverse effective temperatures /3 might be appropriate for 
systems with multiple timescales, and that an appropriate coarse-graining (or suitable measure on dynamical modes, 
etc.) can in principle be found. In a similar vein we mention the possibility of applying superstatistics for systems 
with multiple timescales |118H120] . 

Finally, we note that it is possible to take to be (very nearly) inversely proportional to the largest energy scales 
for a single Glauber-Ising spin (at high temperatures, this is dictated by the environment; at low temperatures, by the 
product of the magnetic moment and applied field) in such a way that (3 = [3fd = fio- This is detailed in appendix 
[C| Because mixing and relaxation times are both broadly interchangeable with inverse spectral gaps, such a choice of 
ioo is consistent with and provides additional support for the particulars of our approach to Anosov systems. 

We conclude this section by mentioning that a coarse-graining scheme for glass-forming systems in which quasi- 
species are defined via the average local coordination statistics of particles superficially appears to indicate a possible 
avenue for applying our effective temperature to glasses in an altogether different manner. In this scheme the relative 
concentration of quasi-species are determined by a generalized Gibbs distribution and a relaxation time naturally enters 
the phenomenological framework |121j . However, this relaxation time scales superexponentially (versus linearly) with 
Po and so using it for too would eventually lead to (3 Q < (3, while an effective temperature for glasses should satisfy 
the reverse inequality. Indeed, an effective temperature appropriate to this scheme has already been developed |122j . 
It would be of interest to see if the timescale obtained by inverting our framework while forcing /3 _1 to equal this 
particular effective temperature turns out to carry physical significance. 

XIX. CONCLUSION 

The relevance of the cat map and the geodesic flow on a surface of constant negative curvature for statistical 
physics derives from their privileged status as the prototypical Anosov diffeomorphism and flow, respectively. The 
Markovian symbolic dynamics exhibited by Anosov systems not only highlights their chaotic properties but also their 
correspondence with spin systems. Moreoever, the chaotic hypothesis maintains that Anosov systems are essential to 
the dynamics of nonequilibrium steady states. 
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While of course Markov structures (i.e., partitions or sections) are not unique, it is nevertheless evident that 
phenomena which hold for any Markov structure of an Anosov system are likely to be of relevance to statistical 
physics. One remarkable phenomenon in this vein is the observed limiting behavior of as calculated on greedy 

partitions for both two-dimensional hyperbolic toral automorphisms and the geodesic flow. While any proof of 
these phenomena would seem to require the development of new and nontrivial mathematics, the observed behavior 
nevertheless appears to be generic. 

Of particular importance is the implications that the evidence of this limiting behavior has for a proposed general 
theory of nonequilibrium steady states. As we have argued in this paper, it is plausible to extend the chaotic 
hypothesis with a conjectured generalization of the variational principle in which a suitable minimization of the 
effective free energy uniquely determines the inverse effective temperature. Moreover, such results justify a proposal 
for a comprehensive framework for nonequilibrium statistical physics that simultaneously incorporates and extends the 
formalism originally introduced by Ruelle and subsequently refined by Gallavotti, Cohen and others. The framework 
has as its goal a broad theory of nonequilibrium statistical physics that is truly intrinsic: i.e., that provides information 
about physical observables simply in terms of raw temporal information about the dynamics. 

One reason to consider a proposal of the sort described here, in which the concept of (effective) temperature plays 
such a central role, is because there is no generally accepted physical definition of entropy for non-equilibrium steady 
states. It is hoped that the present work will serve to elicit fruitful investigations into the fundamental nature of 
stationary physical systems far from equilibrium. 

Appendix A: Proof of the lemma from section [X| 

W/l/o/g, let n = \TZ\: it suffices to show that TZ' := TZ V T R n is Markov. Let {£ u . . . ,t L } := {£ : a{lZ) nl ^ 0} and 
(again w/l/o/g) set R'j := Rj for 1 < j < n, and i?; i+r _ 1 := T~ 1 (TR n n R lr ) = R n n T~ 1 R lr for 1 < r < L. For 
the proof we may disregard boundaries of rectangles and we shall also employ the convention that 1 < k < n and 
1 < s < L. In the event that x £ Rj fl T~ l Rk for some j and k the Markov conditions are trivially satisfied. There 
are three remaining cases which we shall address individually. 

Case 1: x £ Rj n T~ 1 R n+s _ 1 for some j, s. We must show that i) TW s (x, Rj) C W s (Tx, R' n+S _ 1 ) and ii) 
W u {Tx,R' n+s _ 1 ) C TW u (x,Rj). To see i), note first that W s (Tx, R' n+s -i) = W s (Tx,R n ) n T^Rt.. Since TZ 
is Markov, we have that TW s (x,Rj) C W s {Tx,R„), so for i) we need only show that TW s (x,Rj) C T^R^. 
Since T^TZ is Markov, we have that TW s {x, T~ 1 R n ) C W(Tx, T~ 1 R e J C T~ 1 R l , s , so we need only show that 
W s (x, Rj) C W s (x, T~ 1 R n ). But this follows directly since TZ is Markov, establishing i). To see ii), note that 
W^TZj-^+s-i) = W u (Tx,R n ) nr 1 ^ and W u (Tx,R n ) C TW u (x,Rj) since TZ is Markov. This establishes ii) 
and hence Case 1. 

Case 2: x £ R' n+r _ 1 H T~ 1 Rk for some r, k. In this event, since 1Z is a partition, it follows that l r — k. So we 
must show that iii) TW S (x, R' nJrr _ x ) C W s (Tx, R lr ) and iv) W u (Tx, R lr ) C TW u {x,R' n+r _ l ). To see iii), note that 
TW a (a;,i4+r-i) = TW s {x,R n )n R tr . Since U is Markov, TW s (x,R n ) C W s (Tx, R ir ), from which iii) follows. To 
see iv), note that TW u {x,R' n+r _ 1 ) = TW u (x,R n ) n R ir , and since TZ is Markov W u (Tx,R ir ) C TW u (x,R n ). This 
establishes iv) and hence Case 2. 

Case 3: x £ i?; i+r _ 1 n T~ 1 R' n+s _ l for some r, s. We must show that v) TW s {x, K +r _i) C W s (Tx, i?; +s _i) 
and vi) W u (Tx,R' n+s _ 1 ) C TW u {x,R' n+r _ l ). To see v), note that TW s (x, R^+r-i) = TW s (x, T~ 1 Ri r ) n TR n and 
W^Tx.R'^) = W^Tx^^R^) n R n . Since T~ lr R is Markov and x £ T~ r R tr n T~ 1 (T~ 1 Rg s ), we have that 
TW a (x,T-^Rt r ) C W s {Tx,T- l R ta ), so we need only show that W s (Tx,T~ 1 R.£ s ) C\TR n C W s (Tx,T~ 1 R e J n R n . 
But this follows since TZ is Markov and Tx £ TR n n Rn. 

To see vi), note that W u (Tx, = W u {Tx,T~ 1 Ri a ) n Rn and TW u {x, R n+r -i) — TW u (x, T~ 1 R ir ) n TR n . 

Since T~ X TZ is Markov and x £ T^R^ n T~ 1 (T~ 1 R es ), we have that W u (Tx,T~ 1 R is ) C TW u {x,T~ 1 R lr ). Since 
x £ R n n T~ x Rn and TZ is Markov, W^{Tx,R n ) C TTV u (a;, R n ). It follows that W"(Tx, T~ l Ri s ) n ^"(Ts, i?„) C 
TVy u (x, T~ 1 R( r ) n TVy u (x, i?„), from which vi), Case 3 and the lemma follow in turn. □ 

Appendix B: Variational principle 

A subset E C M is called (k, e)-separated iff for every x ^ y there exists j € {0, . . . , k — 1} s.t. d(T^x, T^y) > e, 
in which case we write E 1 = -E(T, fc, e). If T is hyperbolic or expanding, a natural way to try to build maximal (fc, e)- 
separated sets is to consider points spaced at distances ~ e on a local unstable manifold. We will use this intuition 
to provide a self-contained heuristic sketch of the so-called variational principle as it applies to Anosov systems. For 
a more detailed and rigorous discussion that is also more generally applicable, see |12j . 
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For nice define 0(e), := YljZi 4>{T^x) and consider z(T,<p,E) := J2xee ex P[0(E),feO&)]- Write z(T,<f>,k,e) :— 
s\ip E i Ttk<e \z(T,4>,E(T,k,E)), and let E*(T,k,e) be a saturating set for which the supremum is attained. Un- 
der the reasonable assumption that E*(T,k,e) can effectively be given by the construction described above, then 
z(T,(j>,E*(T,k,e)) = z(T,(f>,k,e) should be approximately equal to \E*(T,k,e)\ ■ exp [k f M (j> d/j,*], where //* is a 
T-invariant probability measure on M that has nice regularity properties on unstable manifolds. 

We consequently expect that 

i log «(T,0, fc,e)« i log|£r(T, fc,e)|+ / d/x*. (Bl) 
It can be shown that the topological entropy 



M 



h(T) :=]imh(e,T) = lim lim sup ^ log [ sup |£(T, jfe, e)| ] (B2) 



is well-defined |146j and (Bl) and (B2| combine to illustrate why the topological pressure P(T, 0) (which is really a 
generalized free energy density) satisfies 

P(T,0) := lim lim sup \ log z{T,<p,k,e) = h(T) + [ dfu, (B3) 

for a suitable T-invariant probability measure /i*, called an "equilibrium state" for w/r/t T (note that in the present 
context an equilibrium state can actually describe a physical nonequilibrium steady state). As we have just sketched, 
SRB measures-which can be defined as precisely the T-invariant measures with the strongest regularity properties on 
unstable manifolds-are equilibrium states. 
Finally, it can be shown that 



P(T,<f>)>h M (T) + / 0<^ (B4) 

JM 

for a generic T-invariant Borel probability measure /i, where h^(T) denotes the metric or Kolmogorov- Sinai entropy 
sup^y limfc^oo k^ 1 S ^(V^ Z^T^ 3 X), and where S^(X) := — J2xex log p{X) is the (Shannon) entropy of X w/r/t 



/i. The equations (B3| and (B4) comprise the variational principle. This principle highlights not only the relationship 
between topological and metric entropies, but also characterizes equilibrium states or (equivalently for our purposes, by 
the symbolic dynamics correspondence) Gibbs measures uniquely determined by well-behaved potentials on subshifts 
of finite type-i.e., by physically reasonable interactions on one-dimensional spin systems. 

Appendix C: A single Glauber-Ising spin 

Consider a single Glauber-Ising spin a. Its dynamics are determined by an overall inverse timescale a and b := 
tanh(/3 Q ///i), where /i is the magnetic moment and h is the magnetic field |123j . The conjugate to a is — fih and the 
dynamics corresponding to a = (—1, 1)* are given by the stochastic generator matrix 

Q -1 J?\) ( C1 ) 



2 V 1 - 6 -(!-&) 

with corresponding invariant distribution p — |(1 — b, 1 + 6). 

In general b and Q inherit any functional time-dependence of h, requiring that we consider the Markov propagator 
U(t w ,t) := C/" 1 (£„)[/(*), where 

U{t) := O* exp f Q(s) ds, (C2) 
Jo 

and O* indicates the formal adjoint or reverse ordering operator. Thus (e.g.) the distribution p(s) is propagated as 
pit) = p(t w )U(t w ,t). In the situation of interest however h merely changes from one constant value h on (— 00, t w ) 
to another constant value h + Sh on [t w , 00), so U(t w , t) is just a matrix exponential. A routine calculation using the 
identities (f{t)f{t w )} = J2 3 ,kPj( t ^)fj u jk(tw,t)f k , (f(t w )) = EjPj(^)/j and (/(*)) = T l j,kPji t ^) U jk(tw,t)fk for 
time-independent / leads to 

(a(t)a(t w )) - {a(t))(a(t w )) = C(t - t w ) = (1 - b 2 )e~ •<*-*->. (C3) 



35 



Moreover, 



(a(t)} = (a(t w )) + 



b{t') dt' 



(C4) 



and in the present context this simplifies to (cr(t)} = (b + 5b) — 5b ■ e~ a ^~ tw \ Now db/d(—(ih)\o = —f3 and so 
5(a(t))/5(—fj,h(t w ))\o = /3 e _0 ( t_t ™). When taking partial derivatives w/r/t t w or t it is appropriate to set a = 1 
(or to rescale the time variables by a). A line of algebra now shows that the equilibrium FDT holds for any h, not 
just h = 0, i.e. = j3 a . This is in some sense a peculiarity of the single-spin system: there is only one nontrivial 
observable, so any number of conjugate fields can be amalgamated. 
Meanwhile one has ||p|| 2 = (1 + b 2 )/2 and ||7|| 2 = 2{p o ^h) 2 , so that 



/3 2 = C' 1: ^ 2 (2[/3 ^] 2 + l), 



(C5) 



and enforcing j3 — /3 Q leads to 



too 



(l + 6 2 )(2[/3 ^] 2 + l) 



1/2 



(C6) 



The question now is whether or not this is a physically reasonable characteristic timescale. We have that « V2(3 
for j3 a small and ~ 1/ for (3 a large. In both regimes is asymptotically inversely proportional to the largest 
energy scale, and indeed by equipartition the constant of proportionality can be chosen to be the same in both cases. 
On this basis can be regarded as a physically reasonable characteristic timescale similar to a relaxation (or mixing) 
time. [147] It would be of interest to determine if a relationship of this sort applies to other systems. 
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Let T : M — > M be a diffeomorphism and let /i be an T-invariant measure. By Oseledets' multiplicative er- 
godic theorem (see, e.g., |45|). the tangent space decomposes as T X M — ©i =1 T^ , where the Lyapunov exponents 
Afc := limn-t-oo n _1 log||D I T n u ( - fc ' || are well defined /i-almost everywhere for £ Tx k \ For an Anosov diffeomor- 

phism the Lyapunov exponents are constant and the hyperbolic structure is manifested as — (B fc . Afc<0 T'j'*' an d 

/T™ — iXi rp(k) 
— Wk:X k >0 ±x ■ 

[135] In general, for a C 2 diffeomorphism T with at least one Lyapunov exponent that is a.e. positive w/r/t v, a SRB measure 
is a T-invariant measure whose conditional measures on unstable manifolds are absolutely continuous w/r/t the relative 
Riemannian volume. 

[136] Two maps / : X — > X and g : Y — > Y are called topologically conjugate if there is some h : X — > Y such that / = hogoh^ 1 . 
In this case, f n ~hog n o h~ 1 as well. Intuitively, the notion of topological conjugacy can be regarded as equivalence 
w/r/t a change of coordinates. 

[137] Another more exotic reason one might expect Markov partitions to have physical relevance, especially in concert with a 
microscopic partition X, is because of the relationship between periodic orbits (which correspond to periodic sequences 
of the form {TZ{T k x )} for any 7?.) and the density of states in semiclassical quantum theory [88] . 
[138] Let T be an ergodic transformation of a probability space (M, fi,P) endowed with a finite partition 1Z = {Ri, . . . ,Rn}, 
are a.s. disjoint and have positive probability. For x € Rk define r(x) := inf{£ > : T e x € Rk}- The Kac 
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lemma (see, e.g. |1241 1125] ) gives that f R t(x) dV(x) — 1. Now j x t(x) d¥(x) — J^ fe J R ^ t(x) dP(x) = n, or equivalently 
Er = n. In words, the average first return time is simply the number of elements of the partition. 

[139] An analogous mixing time also exists for Markov processes. Define the total variation distance between two probability 
measures P and P' as ||P-P']| TV := sup x |P(A)-P'(X)| = § £JP(x) - P'(z)|, where the last equality can be taken as a 
lemma. If P is the transition matrix for a Markov chain with stationary distribution p and 5 > 0, set d(i) := sup,, \\P e (j, •) — 
p\\tv- The standard convergence proof for Markov chains shows that d(£) is bounded by a decaying exponential, which 
motivates the definition of mixing time as t m ix(S) := inf {£ : d(£) < 5}. The relaxation time is given by the inverse 
absolute spectral gap, viz. t re i := (1 — A,) -1 , where A» = sup{|A| : A G spec(P) AA / 1}. If P is reversible, irreducible, 
and aperiodic, then the two timescales are closely related: viz. (t re i — 1) • Iog(l/2<5) < t m ix(S) < trei • log(l/<5infj pj) [126] . 
For the Ising model with Glauber dynamics, both t m ix '■= t m i X (l/4) > t m i x (5)/ [log 2 <5 _1 ] and t rB i depend on the inverse 
temperature, and in fact these timescales can exhibit sharp transitions depending on the temperature. In general however 
one expects the temperature dependence to be monotonic as a function of a characteristic timescale; indeed, the effective 
inverse temperature f3 and associated timescale ioo scale linearly with each other [34] . 

[140] Note that by taking suitable partition refinements we can force the Markov and transition matrices to coincide. 

[141] Although in general Markov partitions will not have such strong regularity properties, the so-called baker map (which is 
not Anosov) has SRB measure equal to Lebesgue measure on the unit square and admits Markov partitions consisting of 
regular grids of squares [127] . 

[142] This line of reasoning actually provides a demonstration of the well-known fact that the SRB measure for a hyperbolic 
toral automorphism (i.e., the pu shfo rward of Lebesgue measure) is a measure of maximal entropy. By the same scaling 
argument as employed in section VI we have that log(«+0^ m '*) = — Cml + o(m), where C is a positive constant and 1 

(here) denotes a matrix of all ones. Since J]\- k z+^z_^ = 1, we have that JZj k ^jk^ z +^ z -k l°s( 2; +j' 2 -"fc) — ^ 1> 
and moreover that C equals both the Kolmogorov-Sinai and topological entropy. 

[143] Here we are implicitly using the Adler-Konheim-McAndrew formulation of topological entropy, which can however be 
shown equivalent to the Bowen-Dinaburg formulation touched on in appendix [B| 

[144] As a dynamical system, the GIK thermostat including an electric field E is defined by v = x and V„« = E — v{E,v)/{v,v). 
Here V denotes the covariant derivative. On a surface of constant negative curvature this is a (dissipative, reversible) 
Anosov system |128l 1129] . Physically, the electric field that E represents creates currents circulating around the holes 
in the surface the particle is restricted to. The thermostat itself maintains the kinetic energy of the system under the 
influence of E. Unless E has a global potential, the Anosov GIK thermostat is dissipative |13U] and (near equilibrium) 
has positive entropy production [131] . For a phenomenological analogue (i.e., a dissipative reversible perturbation) for 
the cat map, see [128] . 

[145] Compare this with the behavior for the Shannon entropy: S((t/m)® m ) = S(t) + logm. 

[146] By a result of Margulis, h(T) governs the growth of the number N(t) of periodic orbits of period < t when T is Anosov 
through the relationship htN(t)/e ht — > 1 |132] , (The idea is that an experimenter can distinguish sup B ( T k e AE(T, k,s)\ 
trajectories after k timesteps of evolution under T, so that h(e, T) describes the growth of distinguishable trajectories over 
long times.) It can also be shown that h(T) is the supremum of metric or Kolmogorov-Sinai entropies over the T-invariant 
Borel measures; by the Pesin-Ruelle inequality, the supremum is attained by a SRB measure. 

[147] An alternative choice for too motivated by recurrence considerations has been considered elsewhere, namely too = QZ\ + 
— 4a _1 (l — 6 2 ) -1 = 4a _1 cosh 2 (/3 fih) [133] . (As with /3fd, it would be appropriate here to require a = 1.) It is 
worth noting that here as in general there is a vast gap between recurrence and mixing or relaxation times. 



